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TECHNICAL FIELD ..• ;i •■' ~' ' 

This invention relates to nucleic acid and amino acid sequences of receptors and associated . 
5 proteins and to the use of these sequences in the diagnosis, treatment, and prevention of neurological , , 

- -disorders; immu nologi^^ 
.disorders, including cancer.- - 

BACKGROUND OF THE INVENTION 

10 The term receptor describes a protein that specifically 

is broad and includes proteins with a variety of function. The ^ 

which bind" exliacdlular.ligands : and produce cellular respoiises.'in the areas of growth, differeittiatioii, - 
. endocytosis, and immune response. Other receptors facilitate the selective transport of proteins out of the 
endoplasmic reticulum and localize era ymes to particular location in the cell, PropagaUon of cellular = 
15 signals, and transport and localization of proteins, all rely upon specific interactions between receptors 
and* variety of asscx:iated protein. ; The term receptor may a^^ 

ligands with known or uttoown chemical composition and which interact with other cellular components. 
For example, the steroid hormone receptors bind to and.regulate transcription of DN A. 

Cell surface receptors are typically integral plasma membrane proteins. These receptors 
20 recognize hormones such as .catecholan^ 

peptide factors such as Ayrotropin-releasing hormone; galanin, somatostatin,, and tachykim^ 
circulatory system-borne signaling molecules. Cell surface receptors on immune system cells recognize 
antigens, antibodies, and major histocompatibility complex (MHC)-bound peptides. Other cell surface ■ 
receptors bind ligands to be internalized by thecell. This receptor-mediated endocytosis functions in the 
25 • uptake of low" density lipoproteins' (LDL), tfansferrin^glucose- or hiahnose-teri^nal glycoproteins, 
galactose-terminal glycoproteins, immunoglobulins, phosphovitellogenins, fibrin, proteinase-inhibitor 
" complexes, plasminogen activators, and thrombospondin (Lodish, H. et al. (1995) Molecular Cell 

Biology . Scientific American Books, New York NY, p. 723; and Mikhailenko, I. et al. (1997) J. Biol. 

Chem. 272:6784-6791). 
30 Signal transduction is the process of biochemical events by which cells are able to 

" '. • communicate with one another" and respond to extracellular signals. Extracellular signals are 

transduced through a biochemical cascade that begins with the binding of a signal molecule to a cell 
membrane receptor. The signal is propagated to effector molecules by intracellular signal transducing 
proteins and culminates with the activation of an intracellular target molecule. The process of signal . 
35 transduction regulates a wide variety of cell functions including cell proliferation, differentiation, and 



SUBSTITUTE SHEET (RULE 26) 

^BNSDOCtD- <WO 0107612A2J_> '* 



WO 01/07612 



PCT/US00/20035 



gene transcription. 

G-nrotein Coupled Receptors (GPCRs> 

< G-protein coupled receptors (GPCRs) are a class of molecules that participate in signal 

5 transduction in a variety of cell types. GPCRs are integral membrane proteins characterized by the 
presence of seven hydrophobic transmembrane domains which span the plasma membrane and form a 
bundle of antiparallel alpha (a) helices. These proteins range in size from under 400 to over 1000 amino 
acids (Strosberg, A.D. (1991) Eur. J. Biochem. 196:1-10; Coughlin, S.R. (1994) Curr. Opin. Cell Biol. 
6:191-197). The amino-terminus of the GPCR is extracellular, of variable length and often glycosylated; 
10 the carboxy-terminus is cytoplasmic and generally phosphorylated. Extracellular loops of the GPCR 
alternate with intracellular loops and link the transmembrane domains. The most conserved domains of 
GPCRs are the transmembrane domains and the first two cytoplasmic loops. The transmembrane 
domains account for structural and functional features of the receptor. In most cases, the bundle of a 
helices forms a binding pocket In addition, the extracellular N-terminal segment or one or more of the 
15 three extracellular loops may also participate in ligand binding. Ligand binding activates the receptor by 
inducing a conformational change in intracellular portions of the receptor. The activated receptor, in * \ 
turn, interacts with an intracellular heterotrimeric guanine nucleotide binding (G) protein complex which 
mediates further intracellular signaling activities, generally the production of second messengers such as 
cyclic AMP (cAMP), phospholipase C, inositol triphosphate, or interactions with ion channel proteins., 
20 (Baldwin, J.M. (1994) Curr. Opin. Cell Biol. 6:180-190; Watson, S. and S. Arkinstall (1994) The G- c 
protein Lin ked Receptor Facts Book . Academic Press, San Diego C A r pp ) HyHmiygic of hnnnH 
GTP by the G-protein completes the cycle, returning the G-protein to its inactive GDP-bound state. 

GPCRs include receptors for sensory signal mediators (e.g., light and olfactory stimulatory 
molecules); adenosine, bombesin, bradykinin, endothelin, y-aminobutyric acid (GABA), hepatocyte 
25 growth factor, luteinizing hormone (LH), thrombin, thyroid stimulating hormone (TSH), melanocortins, 
neuropeptide Y, opioid peptides, opsins, somatostatin, tachykinins, vasoactive intestinal polypeptide 
family, and vasopressin; biogenic amines (e.g., dopamine, epinephrine and norepinephrine, histamine, 
glutamate (metabotropic effect), acetylcholine (muscarinic effect), and serotonin); chemokines; lipid 
mediators of inflammation (e.g., prostaglandins and prostanoids, platelet activating factor, and 
30 leukotrienes); and peptide hormones (e.g., calcitonin, C5a anaphylatoxin, follicle-stimulating hormone 
(FSH), gonadotropic-releasing hormone (GnRH), neurokinin, and thyrotropin-releasing hormone (TRH), 
and oxytocin). GPCRs which act as receptors for stimuli that have yet to be identified are known as 
orphan receptors. For example, the TPRA40 protein is a GPCR isolated from mouse adipocytes and 
present in a number of mouse and human tissues, whose expression in adipose tissue is altered with aging 
35 and type 2 diabetes (Yang, H. (1999) Endocrinology 140:2859-2867). 
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' ■ GPCR mutations, which may cause loss of function or constitutive activation, have been • 
.' associated with numerous human diseases (Coughlm, supra). For instance, retinitis pigmentosa may 
arise from mutations in the rhodopsin gene.- Rhodopsin is the retinal photoreceptor which is located 
■ withihthe discs of the eye rod cell. Parma, J. et al. (1993, Nature 365:649-651) report that somatic 
5 activating mutations in the thyrotropin rcxeptor cause hyperfunctioning thyroid adenomas and suggest . 
V uSarcertain CPCRs^u 

I ofTSH receptor have been observed in brain tissue from Down syndrome and Alzheimer's disease 
patients/suggesting an apoptotic role for this receptbr in neurodegenerative disorders (Labudova, O. et 
al. (1999) Life Sci. 64:1037-1044). Many clinically relevant drugs act on GPCRs, including a and pV 

10 blockers which affect the activity of adrenergic receptors and are used in the treatment of hypertension 
and other cardiovascular disorders (W atson, supra, pp. 32-33). 

Receptors Involved in the Immun e System 

Examples of GPCRs implicated in inflammation and the immune response include the EGF 
15 module-containing, mucin-like hormone receptor (Emrl) and CD97 receptor proteins. These GPCRs 
are members of the recently characterized EGF-TM7 receptors family. These seven transmembrane 
hormone receptors exist as heterodimers invivo and contain between.three and seven: potential 
calcium-binding EGF-like motifs. CD97 is predominantly expressed in leukocytes, and is markedly 
upregulatedon activated B and T cells. (McKnight, A. j. and Gordon, S. (1998) J. Leukoc. Biol. 

20 63:271-280.) 

Irregularities in the GPCR signaling cascade may result in abnormal activation of leukocytes and 
lymphocytes, leading to the tissue damage and destruction seen in many inflammatory and autoimmune 
. diseases such as rheumatoid arthritis, biliary cirrhosis, hemolytic anemia, lupus erythematosus, and 
- thyroiditis. Abnormal cell proliferation, including cyclic AMP stimulation of brain, thyroid, adrenal, and 
25 gonadal tissue proliferation is regulated by G proteins (Meij, J.T. A. (1996) Mol. Cell: Biochem. 157:3 1- 
38; Aussel, C. et al. (1988) J. Immunol. 140:215-220). 

T cells play a dual role in the immune system as effectors and regulators, coupling antigen 
recognition with the transmission of signals that induce cell death in infected cells and stimulate other 
immune cells. Although T cells collectively recognize a wide range of different antigens, a clonal line of 
30 T cells can only recognize a single antigen. Moreover, the antigen must be presented to the T cell 

receptor (TCR) as a peptide complexed with a major histocompatibility molecule (MHC) on the surface 
of an antigen-presenting cell. The TCR on most T cells consists of two polypeptide subuhits, a and p, 
which are immunoglobuun-like integral membrane glycoproteins of similar molecular weight. The TCRa 
and TCRp subunits have an extracellular domain containing both variable and constant regions, a 
35 transmembrane domain that traverses the membrane once, and a short intracellular domain (Saito, H. et 
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al. (1 984) Nature 309:757-762). The genes for the TCR subunits are constructed through somatic 
rearrangement of different gene segments. Interaction of antigen in the proper MHC context with the 
TCR initiates signaling cascades that induce the proliferation, maturation, and function of cellular 
components of the immune system (Weiss, A. (1991) Annu. Rev. Genet. 25: 487 : 510). Rearrangements 
5 in TCR genes and alterations in TCR expression have been noted in lymphomas, leukemias, autoimmune 
disorders, and immunodeficiency disorders (Aisenberg, A.C. et al. (1985) N. Engl. J. Med 313:529-533; 
Olive, C. (1995) Immunol. Cell. Biol. 73:297-307; and Weiss, supra) . Immunizations with peptides 
derived from TCRs are effective treatment for some human T-cell-mediated autoimmune disease and in 
animal models of such illnesses, in particular, rheumatoid arthritis (Bridges, S.L. and Moreland, L.W. 
10 (1998) Rheum. Dis. Clin. North Am. 24:641-650). 

Tumor necrosis factor (TNF) is a pleiotropic cytokine that mediates immune regulation and 
inflammatory responses. The cellular responses triggered by TNF are initiated through its interaction 
with two distinct cell surface receptors, TNF- R 1 and TNF-R2. (Tartaglia, L.A. and Goeddel, D.V. 
(1992) Immunol. Today 13:151-153). Both TNF receptors are part of the TNF receptor (TNFR) 
15 superfamily, whose members include the Fas antigen, the p75 subunit of the NGF receptor, the TRAIL 
receptor, TRUNND, SalF19R, CD27, CD30, and CD40. Members of the TNFR superfamily share the 
TNFR/NGFR family cysteine-rich region signature, which consists of cysteine-rich pseudo-repeats in the 
extracellular domains. (ExPASy PROSITE document PDOC00561; Pan, G. et al. (1998) FEBS Lett. 
424:41-45; Bairoch, A. et al. (1997) Nucleic Acids Res. 25:217-221 ; and Smith, C.A. et al. (1994) Cell 
20 76:959-962). Polymorphisms in TNF-R2 are associated with systemic lupus erythematosus (Komata, T. 
et al. (1999) Tissue Antigens 53:527-533). In addition, increased serum concentrations of soluble TNF- 
Rl have been observed in some patients with advanced gastric or colorectal cancer (Shibata, M. et al. 
(1998) Surg. Today 28:884-888). 

Another essential component of the immune respoase is the complement system, which responds 
25 to signals provided by antigen recognition by mobilizing effector activities including inflammation, 
phagocytosis, and cell lysis. Receptors on macrophages and neutrophils bind activated complement C3 
on the surface of foreign particles such as bacteria,, thus targeting the foreign particles for phagocytosis - 
and destruction by lysosomal enzymes. Complement receptor 1 (CR1) has a wide cellular/tissue 
distribution, and mediates enhancement of phagocytosis, induction of IL-1 secretion and enhancement of 
30 B-cell differentiation. Defective expression of CR1 is associated with the autoimmune disease systemic 
lupus erythematosis. (Carroll, M.C. (1998) Annu. Rev. Immunol. 16:545-568.) 

Nuclear Receptors 

The nuclear receptors are another receptor family, and includes the retinoic acid receptors , 
35 (RARs) and the retinoid X receptors (RXRs). RARs and RXRs can form heterodimers which are - 
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thought to have a signal transduction function: Retinoic acid (RA) is a biologically active metabolite of 
vitamin A (retinol), a fat-soluble vitamin found mainly in fish liver oils, liver, egg yolk, butter, and 
cream. Retinol cannot be synthesized in vivo and must be obtained from the diet. Retinol, R A, and other 
retinoids influence epithelial cell differentiation. A number pf carrier proteins which bind retinol or other 
5 retinoids have been identified. These retinoid binding proteins (RBPs) appear to direct bound retinoid 

t ~ 'molecules to specific metabolic pathwaysr-Specific receptors^for RBPs mediate the cellular uptake of- - 
retinoids and the transfer of retinoids to intracellular RBPs (Suhdaram, M. et al. (1999) J. Biol. Chem. 

' 273:3336-3342); ' r:v ' -. : \ r " r:y.y^r^ - vr. - - - . - ■■•■^■^ - 

10 Low Molecular Weight (LMW) G-proteins 

Low molecular weight (LMW) G-proteins regulate cell growth, cell cycle control, protein 
secretion, and intracellular vesicle interaction. They consist of single polypeptides which are able to bind 
to and hydrolyze GTP, thus cycling between an inactive and an aictive state. LMW G-proteins respond to 
extracellular signals from receptors and activating proteins by transducing mitogenic signals involved in 

15 various cell liinctions. The binding and hydrolysis of GTP regulates the response of LMW G-proteins 
and acts as an energy source during this process (Bokoch, G. M. and Da*, C. J. (1993) FASEB J. 

: 7:750-759). '; - " . . - : .- \ / / ; ' \ . 

At least sixty members of thQ LMW G-protein superfamily have been identified and are currently 
grouped into the ras, rho, arf, sari, ran, and rab subfamilies. Activated ras genes were initially found in 

20 human cancers and subsequent /studies confirmed that ras function is critical to receptor tyrosine kinase- 
mediated signal transduction pathways that determine.whether ^ cells continue to grow and divide, or 
whether they differentiate. Rho G-proteins control signal transduction pathways that link growth 
receptors to actin polymerization, which is necessary for normal cellular growth and division. The rab, 
arf, and sar 1 families of proteins control the translocation of vesicles to and from membranes for protein 

25 localization, protein processing, and secretion. Ran G-proteins are located in the nucleus and have a key 
role in nuclear protein import, the control of DNA synthesis, and cell-cycle progression (Hall, A (1990) 
Science 249:635-640; Barbacid, M. (i 987) Ann. Rev Biochem. 56:779-827; and Sasaki, T. andTakai, 
Y. (1998) Biochem. Biophys. Res. Commun. 245:641-645). 

LMW G-prbt'ei hs are GTPases which cycle between the active GTP-bound and inactive 

30 GDP-bound forms. At least three types of proteins regulate this process: GTPase T activating proteins, 
(GAP), which stimulate GTP hydrolysis by the LMW G-protein; guanine nucleotide exchange factors 
(GEP), which facilitate the exchange of GDP bound to the LMW G-protein for GTP; and guanine 
nucleotide dissociation inhibitors (GDI), which inhibit this reaction (Ikeda, M. et al. (1998) J. Biol. 
Chem. 273:814-821; Quilliam, L. A; (1995) Bioessays 17:395-404). The best characterized GEP is the 

35 mammalian homologue of the Drosbphila Son-of-Sevenless protein. Both GEP and GAP activity may be 
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affected by extracellular stimuli and modified by accessory proteins such as RalBPl and POB1. Mutant 
Ras-family proteins, which bind but can not hydrolyze GTP, are permanenUy activated, and cause cell 
proliferation or cancer, as do GEP that activate LMW G-proteins (Drivas, G. T. et al. (1990) Mol. Cell. 
Biol. 10:1793-1798; and Whitehead, I. P, et al. (1998) Mol Cell Biol. 18:4689-4697). , 

5 _ . "... . . . . . . 

OlfactoixCPCRs 

Another large subfamily of GPCRs are the olfactory receptors. These receptors share the seven 
hydrophobic transmembrane domains of other GPCRs and function by registering G protein T mediated 
transduction of odorant signals. Numerous distinct olfactory receptors are required to distinguish 
10 different odors. Each olfactory sensory neuron expresses only one type of olfactory receptor, and distinct 
spatial zones of neurons expressing distinct receptors are found in nasal pasages. 

The discovery of new receptors and associated proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatment of cell proliferative, autoimmune/inflammatory, and neurological disorders. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, receptors and associated proteins, referred to 
collectively as "RECAP" and individually as "RECAP- l- f " "RECAP-2," "RECAP-3," "RECAP-4," : 
"RECAP-5," "RECAP-6," "RECAP-7," "RECAP-8," "RECAP-9," "RECAP-10," "RECAP-1 1," 

20 "RECAP-12," "RECAP-13," "RECAP- 14," "RECAP-15," "RECAP-1 6," "RECAP-17," "RECAP-18," ; 
"RECAP-19," "RECAP-20," "RECAP-21," and "RECAP-22," In one aspect, the invention provides an 
isolated polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-22, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 

25 group coasisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 -22, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-22. In one alternative, the invention 
provides an isolated polypeptide comprising the amino acid sequence of SEQ ID NO:l-22. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising an 

30 amino acid sequence selected from the group consisting of a) an amino acid sequence selected from the 
group consisting of SEQ ID NO:l -22, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a 
biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-22, and d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

35 of SEQ ID NO: 1-22. In one alternative, the polynucleotide encodes a polypeptide selected from the 
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^^upconsisUngofSEQIDNO:^ 

group consisting of SEQ ID NO:23-44. ; • 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 

-sequer^^^^ 

5 selected from the group consisting of a) an amino acid sequence selected from the group .pnsisUng of 
SEQ ID NO' 1 -22, b)"a~naturally occurring amino-aGid sequence haying, at least 90%. sequence identity to 
an amino acid sequence selected from the group cor^isUng of SEQ ID NO: 1-22/c) a biologically acUve 

immunogenic fragment of an amino acid sequence selected from thegroup consisUngof SEQ ID NO:l- 
10 22 Inonealternative^in^ V 
another alternati^^ V 
polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group ^ consisUng of a) an amino acid sequent selected ^ ; . 

15 consisting of SEQ ID NO:l-22, b) a naturally occurring amino acid sequence having at least 90% 

sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l-22. c) a 
biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NOl-22 and d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
of SEQ ID NO l-22. The method comprises a) culturing a cell under conditions suitable for express.on 
20 of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a 
promoter sequence operably linked to a polynucleotide enccxiing the pol^^ 
polypeptide so expressed. 

•Additionally, tte invention provides an isolated antibody which specifically binds to a- 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
25 sequence selected from the group consisting of SEQ ID NO:l -22, b) a naturally occurring amino acid 
• sequence having at least 90% sequence identity to an amino acid sequence selected from the group 

consisting of SEQ ID NO:l -22, c) a biologically active fragment of an amino acid sequence selected from 
the group consisting of SEQ ID NO:l -22, and d) an immunogenic fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO:l -22. 
30 The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 

selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:23-44,b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b). and e) 
35 an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous 
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nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
5 SEQ ID NO:23-44, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44 f c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising at 
least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide in the 

10 sample, and which probe specifically hybridizes to said target polynucleotide, under conditions whereby a 
hybridization complex is formed between said probe and said target polynucleotide or fragments thereof, 
and b) detecting the presence or absence of said hybridization complex, and optionally, if present, the 
amount thereof. In one alternative, the probe comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, said 

15 target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 

selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:23-44, b) a naturally occurring polynucleotide sequence having at least 70% sequence , 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, c) a . 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 

20 an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or fragment 
thereof using polymerase chain reaction amplification, and b) detecting the presence or absence of said 
amplified target polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention further provides a pharmaceutical composition comprising an effective amount of 
a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 

25 sequence selected from the group consisting of SEQ ID NO: 1-22, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22, and a pharmaceutically acceptable excipient In 

30 one embodiment, the pharmaceutical composition comprises an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-22. The invention additionally provides a method of treating a disease 
or condition associated with decreased expression of functional RECAP, comprising administering to a 
patient in need of such treatment the pharmaceutical composition. 

The invention also provides a method for screening a compound for effectiveness as an agonist 

35 of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
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acid sequence selected from the group consisUng of SEQ ID NO:l-22. b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an aniino acid sequence selected from the group 
• consisting of SEQ ID NO: 1-22, c) a biologically acUve fragment- of ah aniino acid sequence selected from 
■ the group consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of an amino acid sequence 
5 selected from the group consisUng of SEQ ID Nb:l-22. The method comprises a) exposing a sample 
- comprising the polypepUde to a compound, and b) detecUng agonist.activity in the samplej Jn one _ ... _. 
alternative, the invention provides a pharmaceutical composition comprising an agonist compound 
' " identified by the method and a pharmaceutically acceptable excipient. In another alternative, the 
invention provides a method of treating a disease or condition associated with decreased expression of 
10 functional RECAP, comprising administering to a patient in need of such treatment the pharmaceutical 

; composition. . : 

Additionally, the invention provides a method for screening a compound for effectiveness as an 
antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO:l-22, b). a naturally occurring 
15 amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group-consisting of SEQ ID NO:l -22, c): a biologically active frag 

selected from the group consisting of SEQ ID NO: 1-22, and d) an immunogenje fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1 -22. The method comprises*) 
exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist a0vity in 

20 the sample.. In one alternative, the invention provides a pharmaceutical composition comprising an 
antagonist compound identified by the method and a pharmaceutically acceptable excipient. In another 
alternative, the invention provides a method of treating a disease or condition associated with- 
overexpression of functional RECAP, comprising administering to a patientin need of such treatment 
the pharmaceutical composition. 

25 . The invention further provides a method of screening for a compound that specifically binds to 

a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-22, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of. SEQ ID NO:l-22, c) a biologically active fragment of an amino acid sequence selected from 

30 the group consisting of SEQ ID NO:l-22, and d) an immunogenic fragment of an amino acid sequence 
selected from the group consisting of SEQ ID N0:l-22. The method comprises a) combining the 
polypeptide with at least one test compound under suitable conditions, and b) detecting binding of the 
polypeptide to the test compound, thereby identifying a compound that specifically binds to the. 

polypeptide. . ■• - ;- : ■.. ; - . •-. 

35 The invention further provides a method of screening for a compound that modulates the 
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activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from. the group consisting of SEQ ID NO:l-22, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-22 t c) a biologically active fragment of an amino 
5 acid sequence selected from the group consisting of SEQ ID NO:l-22, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 

10 test compound with the activity of the polypeptide in the absence of the test compound, wherein a t 
change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in altering 
expression of a target polynucleotide, wherein said target polynucleotide comprises a sequence selected 

15 from the group consisting of SEQ ID NO:23-44, the method comprising a) exposing a sample 

comprising the target polynucleotide to a compound, and b) detecting altered expression of the target 
polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said method 
comprising a) treating a biological sample containing nucleic acids with the test compound; b) 

20 hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:23- 
44, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, iii) a polynucleotide 

25 sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA 
equivalent of i)-iv). Hybridization occurs under conditions whereby a specific hybridization complex is 
formed between said probe and a target polynucleotide in the biological sample, said target 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:23-44, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 

30 polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, iii) a polynucleotide 
sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA 
equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment of the above 
polynucleotide sequence; c) quantifying the amount of hybridization complex; and d) comparing the 
amount of hybridization complex in the treated biological sample with the amount of hybridization 

35 complex in an untreated biological sample, wherein a difference in the amount of hybridization 
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complex in the treated biological sample is indcadve of toxicity of the test compound. 

= ■) BRIEF DESCRIPTION OF THE TABLES ^ ■ / 

r Table 1 shows polypeptide arid nucleotide sequence identification numbers (SEQ ID NOs), clone 
5 identification numbers (clone IDs), cDNA libraries, andcDNA fragments used to assemble full-length 
^'■''"""Sequences'encoding-REGAP.'-'^;;- 1 - -'-Vr -';-\f ~":'";t. ."" ~ — : - - - - --• • •• 

1 ' ■ ■ Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and meuiols. algorithjrK, and searchable databa used for analysis of RECAP. - . - - 
• Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 
10 patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
conditions associated with these tissues; and the vector into which each cDNA was cloned, 
v ; Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding RECAP were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
15 polypeptides of the invention, along with applicable descriptions, refCTences, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
thai this invention is not limited to the particular machines, materials and methods described, as these 
20 may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
-./ particular embodiments only, and is not intended to limit the scope of the present invention which will be 
limited only by the appended claims. 

. ) It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and "the" include plural reference unless the context ^ clearly dictates otherwise. Thus, for example, a 
25 reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents mereof knov^ to those sldlled in the art, and so for^ 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any 
machines, materials, and methods similar or equivalent to those described herein can be used to practice 
30 or test the present invention, the preferred machines, materials and methods are now described. All 
publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
" • protocols, teageris.aW.vectors which are reported in. the pubBcations and which might.be used in _ 
connection with the invention: Nothing herein is to be construed as an admission that the invention is not 
entitled to' antedate such disclosure by virtue of prior invention. 
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DEFINITIONS 

"RECAP" refers to the amino acid sequences of substantially purified RECAP obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and , 
human, and from any source, whether natural, synthetic, semi -synthetic, or recombinant. 
5 The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 

RECAP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of RECAP either by directly interacting with 
RECAP or by acting on components of the biological pathway in which RECAP participates. 

An "allelic variant" is an alternative form of the gene encoding RECAP. Allelic variants may 
10 result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. Each 
of these types of changes may occur alone, or in combination with the others, one or more times in a 
15 given sequence. 

"Altered" nucleic acid sequences encoding RECAP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as RECAP or a 
polypeptide with at least one functional characteristic of RECAP. Included within this definition aret 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
20 the polynucleotide encoding RECAP, and improper or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide sequence encoding RECAP. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent RECAP. Deliberate 
amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, . ~ 
25 hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological or 
immunological activity of RECAP is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may . 
include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains 
30 having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring protein 
35 molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence to the 
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: complete naUve amino acid sequence associated with the recited protein molecule. * : V : 

."Amplification" relates to the production of additional copies of a nucleic acid sequence. 
\ .Amplification is'generally carried : out using 'polymerase chain reaction (PCR) technologies' weli;knowri in [ 

■ . ' ; the art. v ■-. • - " ' / Y ''7 . . ' ■',*/-■'• 

5 The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 

- _ -RECAP; Antagonists may include proteins such as antibodies, nucleic aicids, carbohydrates t ;s mall _ : 

molecules, or any other compound or composition which modulates the activity of REC AP either by 
"~ 7 : direcfly interacting with REG AP of by acUng on components of the biological pathway in which RECAP 

participates. : • . 

10 The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 

such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind RECAP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. Hie polypeptide or oligopeptide used to 
immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 

15 synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet 
hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

, The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that makes 
contact with a particular antibody. When a protein or a fragment of a protein is used to immunize a host 

20 animal, numerous regions of the protein may induce the production of antibodies which bind specifically 
to antigenic determinants (particular regions or three-dimensional structures on the protein). An antigenic 
determinant may compete with the intact antigen (i.e., the immunogen used to elicit the immune response) 
for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" (coding) 

25 strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; peptide , . / 
■ J nucleic acid (PNA); oligonucleotides having mckiified backbone Unkages such as phosphorothioates, 

methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 2*- 
methoxyethyl sugars or T-methoxyethoxy sugars; or oligonucleotides having modified bases such as 5- 
methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine v Antisense molecules may be produced by 

30 any method including chemical synthesis or transcription. Once introduced into a cell, the 

complementary antisense molecule base-pairs with a naturally occurring nucleic acid sequence produced 

V by the cell to form duplexes which block either transcription or translation. The designation "negative" 
or "minus" can refer to the antisense strand, and the designation "positive" or "plus" can refer to. the 
sense strand of a reference DNA molecule. 

35 The term "biologically active" refers to a protein having structural, regulatory, or biochemical 

' •■ ' ■ " • ' ■ -13' . : . . " . 
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functions of a naturally occurring molecule. ■ Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic RECAP, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific antibodies. 
"Complementary" describes the relationship between two single-stranded nucleic acid sequences 
5 that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 

3'-TCA-5\ ' • . . . • . .... • 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. The composition may comprise a dry formulation or an aqueous solutioa 
10 Compositions comprising polynucleotide sequences encoding RECAP or fragments of RECAP may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be associated 
with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be deployed in an 
aqueous solution containing salts (e.g., NaCl), detergents (e:g., sodium dodecyl sulfate; SDS), and other 
components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 
15 "Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 

DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, 
Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from one 
or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for fragment 
assembly, such as the GEL VIEW fragment assembly system (GCG, ; Madison WI) or Phrap (University 
20 of Washington, Seattle WA). Some sequences have been both extended and assembled to produce the 
consensus sequence. : 
"Conservative amino acid substitutions" are those substitutions that are predicted to least ' : 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions, the table below shows amino 
25 acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. - 



30 



35 



Original Residue 


Conservative Substitution 


Ala 


Gly, Ser 


Arg 


His, Lys 


Asn 


Asp, Gin, His 


Asp 


Asn, Glu 


Cys 


Ala, Ser 


Gin 


Asn, Glu, His 


Glu 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, Gin, Glu 


He 


Leu, Val 


Leu 


lie; Val 
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Arg, Gin, Glu V 
• Leu, He . 

His, Met, Leu, Tip, Tyr * t 

sct " Cys;Thr ' ' 

5 - Thr . , • •- .7 ' ■ Ser, Val . : - ' '■ - - - '•. : ; 

Tip ; PhcTyr , 

Tyr His, Phe, Trp 

■__ _ - _ ■ _ ^; _ jv a i _ _ ; J He, Leu, Thr _ 

10 Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 

backbone in the area of the substitution, for.example, as a beta sheet or alpha helical conformation, . 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
15 absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxy!,, or amino group. A derivative polynucleotide encodes a polypeptide which retains, 
; at least one biological or immunological function of the natural molecule. A derivative polypeptide is one 
20 modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of RECAP or the polynucleotide encoding RECAP which is 
25 identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up to 
: the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 
16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or. at least 500 contiguous nucleotides or amino acid 
30 residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, 'and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures Vl may be encompassed by the present 

35 embodiments. / 

A fragment of SEQ ID NO:23-44 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:23-44 N , for example, as distinct from any pther sequence in the; ; 

j * _ _ _ . _ ; _ __l?L: . ^ 
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genome from which the fragment was obtained. A fragment of SEQ ID NO:23-44 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:23-44 from related polynucleotide sequences. The precise length of a fragment of SEQ ID 
NO:23-44 and the region of SEQ ID NO:23-44 to which the fragment corresponds are routinely 
5 determinable by one of ordinary skill in the art based on the intended purpose for the fragment 

A fragment of SEQ ID NO: 1-22 is encoded by a fragment of SEQ ID NO:23-44. A fragment 
of SEQ ID NO:l-22 comprises a region of unique amino acid sequence that specifically identifies SEQ 
ID NO: 1 -22. For example, a fragment of SEQ ID NO: 1 -22 is useful as an immunogenic peptide for the 
development of antibodies that specifically recognize SEQ ID NO: 1-22. The precise length of a 
10 fragment of SEQ ID NO: 1 -22 and the region of SEQ ID NO: 1 -22 to which the fragment corresponds 
are routinely determinable by one of ordinary skill in the art based on the intended purpose for the 
fragment 

A "full-length" polynucleotide sequence is one containing at least a translation initiation codon 
(e.g., methionine) followed by an open reading frame and a translation termination codon. A "full- 
15 length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 

. "Homology" refers to sequence similarity or, interchangeably, sequence identity, between two or 
more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 
20 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e sequence 
25 alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. For 
pairwise alignments of polynucleotide sequences, the default parameters are set as follows: Ktuple=2, 
gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is selected as 
30 the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between aligned 
polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. MoI. Biol. 215:403-410), which is available from several 
35 sources, including the NCBI, Bethesda, MD, and on the Internet at 
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; http://www.ncbh includes various sequence analysis 

programs including "blastn," that is used to align a known pplynucl^tide sequence with other 
< ' ' ■; polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
5^Sequences" can be accessed and used interactively at http://www.ncbi. nj^^ The 
~ "BLAST 2 Sequences" tool can be~used for both blastn and blastp (cyscussed below). BLAST programs 
are commonly used with gap and other parameters set to default settings. For example, to compare two 
nucleotide sequences, one may use blastn with'the "BLAST 2 Sequences" tool Version 2.0.12 (April-21- 
2000) set at default parameters. Such default parameters maybe, for example: 
10 Matrix: BLOSUM62 \ , 

Reward for match: I .■ '. '• ... • • .' 

Penalty for mismatch: -2 _ 
Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 t 
15 Expect: 10 . ■ • . 

; " Word Size: 11 I. : \ ' 

" Filter: on \ \ 

Percent identity may be measured over the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured over a shorter. length, for example, over the 
20 length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at least 
30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such lengths 
are exemplary only, and it is understood that any fragment length supported by : the sequences shown 
herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which percentage 

identity may be measured . : : 

25 Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in a 
nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences that 
all encode substantially the same proteia ; , 

The phrases "percent identity" and 4< % identity," as applied to polypeptide sequences, refer to the 
30 percentage of residue matches between at least two polypeptide sequences aligned using a standardized 
algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment methods take 
into account conservative amino acid substitutions. Such conservative substitutions, explained in more 
detail above, generally preserve the charge and hydrophobicity at the site of substitution, thus preserving 
the structure (and therefore function) of the polypeptide. . 
35 . Percent identity between polypeptide sequences may be determined using the default parameters 
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of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.1 2e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 
5 polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.12 
(Apr-21 -2000) with blastp set at default parameters. Such default parameters may be, for example: 
10 Matrix: BLOSUM62 

Open Gap: 1 1 and Extension Gap: 1 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 3 

15 Filter: on , 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, a 
fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
20 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (H ACs) are linear microchromosomes which may contain DNA 
sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for chromosome 
25 replication, segregation and maintenance. 

, ' .. The term "humanized antibody" refers to an antibody molecule in which the amino acid sequence 
in the non-antigen binding regions has been altered so that the antibody more closely resembles a human 
antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
30 complementary strand through base pairing under defined hybridization conditions. Specific 
. hybridization is an indication that two nucleic acid sequences share a high degree of complementarity, 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized after 
the "washing" step(s). The washing step(s) is particularly important in determining the stringency of the 
hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding 
35 between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for annealing 
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> of nudcic acid sequences are routinely determinable by one of ordinary skill in the art and may be 
consistent among hybridizauo*^^^ 

to achieve the desired stringency; and therefore hybridization specificity. Permissive annealing conditions , 

occur, for example;^ 
5 pg/ml sheared, denatured salmon sperm DNA. 

- ^erierallyrstringencyorhybridizadonisexpr^ 

under which the wash step is carried out Such wash temperatures are typically selected to be about 5<C 
to 20°C lower Wn the thermal melting point (TJ for the specific sequence at a defined ionic strength and 
pH TheT is the temperature (under defined ionic strength and pH) at which 50% of the target 
10 scquencchy^ridizes to a perfectly matched probe, Ah equation for calculating T m and. amditions for 
nucleic acid hybridization are well known and can be found in Sambrook, J. et al,, 1989, M<*-]ar 
^ n A T 3 hor a torv Manual. 2* ed.. vol. 1-3. Cold Spring Harbor Press, Plainyiew NY; specifically, 
see volume 2, chapter 9. 

• ^stringency' conditions for hybridization between polynucleotides of ^present ihvention :. -. 

15 include wash con^^^^ 

Alternatively, temperatures of about 65»C, 60°C 55°C, or 42°C may be used. SSC concentration may 

- be varied from about 0.1 to 2 x SSC, with SDS being present at about 0. 1%. Typically, blocking 
reagents- a* usedmWock non-specific hybridization. Such blocking-reagents include, for JnsMmce, , t , 
sheared and denatured salmon sperm DNA at about 100-200 ug/ml. Organic solvent, such as formam.de 

20 at a concentration of about 35-50% v/v, may also be used under particular circumstances, such as for 
RNA:DNA hybridizations. Useful variations on these wash conditions will be readily apparent to those 

v 0 f ordinary skill in the art Hybridization, particularly under high suingency conations, may be 

suggestiveof evolutionary similarity between the nucleotides. Such similarity is strongly indicative of a 
similar role for the nucleotides and their encoded polypeptides. 

25 The term hybridization complex" refers to a com^^ 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., Cot or R 0 t analysis) or formed between one nucleic acid: 
sequence present in solution and another nucleic ad 

paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells or 
30 their nucleic acids have been fixed). 

Tre Wds'-insertion".^ «addition- rcfer-td changes in an aminb.acid or nuclec4i^se^ence . , • 
resulting in the addition of one or more ainino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
. disorders; or infectious or genetic disease^ ^ 

35 various facte e.g.. cytokines, chemokines, and other signaling molecules, which may affect cellular and 



19 

; SUBSTITUTE SHEET (RULE 26) 

BNSDOCID^WO 0107612A2_L> . ' 



WO 01/07612 



PCT/US00/20035 



systemic defense systems. ..... ........... 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of RECAP which is 
capable of eliciting an immune response when introduced into a living organism, for example, a mammal. 
The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of RECAP 
5 which is useful in any of the antibody production methods disclosed herein or known in the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 

or other chemical compounds on a substrate. 

The terms "element" and "array element", refer to a polynucleotide, polypeptide, or other- 
chemical compound having a unique and defined position on a microarray. 

10 The term "modulate" refers to a change in the activity of RECAP. . For example, modulation may 

cause an increase or a decrease in protein activity, binding characteristics,. or any other biological, 
functional, or immunological properties of RECAP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
15 synthetic origin which may be single-stranded or double- stranded and may represent the sense or the 

anusense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. ^ 

, "Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably ® 
linked to a coding sequence if the promoter affects the transcription or expression of the coding t| 
20 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where | 
necessary to join two protein coding regions, in the same reading frame. > 1 

; "Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which comprises j f I 

an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of amino acid 
residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs preferentially 
25 bind complementary single stranded DNA or RNA and stop traascript elongation, and may be pegylated 
to extend their lifespan in the cell. 

"Post-translational modification" of an RECAP may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art. These processes may occur synthetically or bipchernically. Biochemical modifications will vary by 
30 cell type depending on the enzymatic milieu of RECAP. 

"Probe" refers to nucleic acid sequences encoding RECAP, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are isolated 
oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical labels 
include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" arc short nucleic 
35 acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
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complemcntarybase-pairing. The primer may then be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
. sequence, e.g., by the polymerase chain reaction (PCR). 

Probes' and primers as used in the present invention topically, comprise' at least '15 contiguous 
5 nucleotides of a known sequence/ In order to enhance specificity, longer probes and primers may also be 
v'- employed, /such as probes and primers that comprise at least 20,' 25 .jO, 40, 50, 6^ 

•least 150 consecutive nucleotides of the disclosed nucleic add sequences^ Probes and primers may be 
considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used! 
10 . Methods for preparing and using probes and primers are described in the references, for example 
Sambrook, J. et al., 1989, Molecular Clon ing- A laboratory Manual. 2 nd ed, vol. 1-3, Cold Spring 
Harbor Press, Plairiview NY; Ausubel, F.M. et al.,1987, Current Protocols in Molecular Biology, Greene 
Publ. Assoc. & Wiley-Intersciences, New York ! NY; Innis, M. et al., 1990, PCR Protocols, A Guide to 
MHhnd. and Applications . Academic Press, San Diego CA. PCR primer pairs can be derived from a 
15 known sequence, for example, by using computer programs intended for that purpose such as Primer 
(Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the artforsuch 
purpose. For example. OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
20 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences and 
." . . is thus useful for designing primers on a.genoIhe:wide^scope.•' The Primer3 primer selectic* program 
25 (available to the public from the Whitehead Institute/MIT Center for Genome Research, Cambridge MA) 
allows the user to input a "mispriming library." in which sequences to avoid as primer binding sites are 
user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for microarrays. 
(The source code for the latter two primer selection programs may also be obtained from their respective 
sources and modified to meet the user's specific needs.) The PrimeGen program (available to the public 
30 from the UK Human Genome Mapping Project Resource Centre, Cambridge UK) designs primers based 
on multiple sequence alignments, thereby allowing selection of primers that hybridize to either the most 
conserved or least conserved regions of aligned nucleic acid sequences. Hence, this program is useful for 
identification of both unique and conserved oligonucleotides and polynucleotide fragments. The . 
oligonucleotides and polynucleotide fragments identified by any of the above selection methods are useful 
35 in hybridization technologies, for example, as PCR or sequencing primers, microarray elements, or 
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specific probes to identify, fully or partially complementary polynucleotides in a sample of nucleic acids. 
Methods of oligonucleotide selection are not limited to those described above, . 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence that 
is made by an artificial combinaUon of two or more otherwise separated segments of sequence. This 
5 artificial combinaUon is often accomplished by cheniical synthesis or, more cornmorJy, by the artificial 
manipulation of isolated segments of nucleic acids, e.g., by.genetic engineering techniques such as those 
described in Sambrook, surxa. The term recombinant includes nucleic acids that have been altered solely 
by addition, substitution, or deletion of a portion.of the nucleic acid Frequently, a recombinant nucleic 
acid may include a nucleic acid sequence operably linked to a promoter sequence. Such a recombinant 
10 nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, eg., based on a 
^ vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is • 
expressed, inducing a protective immunological response in the mammal. - . 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
15 regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated region (UTRs). 
Regulatory elements interact with /host . or viral proteins which control transcripUon, translaUon, or RNA 
stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid; 
• amino acid, or amibody. Reporter molecules include radionuclides; enzvroesi'fluorescent. \ 
^ 20 chemiluminescent, or chromogenic agents; substrates; cofact OTS ; inhibitors; magneUc parUcles; and other 
moieties known in the art r 

An "RNA equivalent." in reference to a DNA sequence, is composed of the same linear sequence 
of nucleotides as the reference DNA sequence with the exception that all occurrences of the nitrogenous 
base thymine are replaced with uracil, and the sugar backbone is composed of ribose instead of 
25 deoxyribose. 

The term Sample" is used in its broadest sense. A sample suspected of containing nucleic acids 
encoding RECAP, or fragments thereof, or RECAP itself, may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA. RNA, or cDNA, in 
solution or bound to a substrate; a tissue; a tissue print; etc. 

30 The terms "specific binding" and "specifically binding" refer to that interaction between a protein 

or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or synthetic 
binding composition. The interaction is dependent upon the presence of a particular structure ofthe 
protein. e.g.. the antigenic determinant or epitope, recognized by the binding molecule. For example if 
an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope A, or the 

35 presence of free unlabeled A, in a reaction containing free labeled A and the antibody will reduce the 
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amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are removed 
from their natural environment and are isolated or separated, and are at least 60% free, preferably at least 
75% free, and most preferably at least 90% free from other components with which they are naturally 

5, associated. , 

\ t A "substitution" refers to the replacement of orie or more amino acid residues or nucleotides by; - 

different amino acid residues or nucleotides, respectively. 

: "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, chips, 

slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, microparticles and 
10 capillaries. The substrate can have a variety of surface forms, such as wells, trenches, pins, channels and 

pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular cell type 

or tissue under given conditions at a given time. 

/ 'Transformation" describes a process by which exogenous DNA is introduced into a recipient 
15 cell. Transformation may occur under natural or artificial conditions according to various methods well 

known in the art,* arid may rely on any known method for the insertion of foreign nucleic acid: sequences : 

into a prokaryotie or eukaryotic host cell. The method for transformation is selected based on the type of 

host cell being transformed and may include, but is not limited to, bacteriophage or viral infection, ^ 
: electroporatioii, heat shock, lipofectipn, and particle bombardment. The term "transformed" cells : 
20 includes stably transformed cells in which the inserted DNA is capable of replication either as an 

autonomously replicating plasmid or as part of the host chromosome, as, well as transiently transformed 

cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to animals 

and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 
25 introduced by way of human intervention, such as by transgenic techniques well known in the art. The 

nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 

by way of deliberate genetic manipulation, such as by microinjection or by infection with a 

recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in vitro 

fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The transgenic 
30 organisms contemplated in accordance with the present invention include bacteria, cyanobacteria, 

fungi, plants, and animals; The isolated DNA of the present invention can be introduced into the host 

by methods known in the art, for example infection, transaction, transformation or transconjugatioh. 

Techniques for transferring the DNA of the present invention into such organisms are widely known 

and provided in references such as Sambrook et al. (1989), supra. 
35 A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
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least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences", tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater . 
5 sequence identity over a certain defined length. A variant may be described as, for example, an "allelic" 
(as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 
identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides due 
to alternative splicing of exons during mRNA processing. The corresponding polypeptide may possess 
additional functional domains or lack domains that are present in the reference molecule. Species 
10 variants are polynucleotide sequences that vary from one species to another. The resulting polypeptides 
generally will have significant amino acid identity relative to each other. A polymorphic variant is a * 
variation in the polynucleotide sequence of a particular gene between individuals of a given species. 
Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in which the 
polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be indicative of, for 
15 example, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 
least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
20 60%, at least 70%, at least 80%, at least 90%, at least 95 %, or at least 98% or greater sequence identity 
over a certain defined length of one of the polypeptides. 
THE INVENTION 

The invention is based on the discovery of new human receptors and associated proteins 
(RECAP), the polynucleotides encoding RECAP, and the use of these compositions for the diagnosis, 

25 treatment, or prevention of neurological disorders; immunological disorders, including - 
autoimmune/inflammatory disorders; and cell proliferative disorders, including. cancer. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding . 
RECAP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide 
and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which 

30 nucleic acids encoding each RECAP were identified, and column 4 shows the cDNA libraries from which 
these clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA libraries. 
Clones for which cDNA libraries are not indicated were derived from pooled cDNA libraries. In some 
cases, GenBank sequence identifiers are also shown in column 5 : The Incyte clones and GenBank cDNA 
sequences, where indicated, in column 5 were used to assemble the consensus nucleotide sequence of each 

35 RECAP and are useful as fragments in hybridization technologies. 
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, : : ; The columns of Table 2 show various properties of each of tie polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of ^ arrnnb ad0 residues in each 
polypepUde; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylate 
\. sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
5 shows homologous sequences as identified by BLAST aiialysis along with relevant citations, all of which 
"are "expressly iricorpot 

and in some cases, searchable databases to which the analytical methods were applied The methods of 
column 7 were used to characterize each polypeptite ti^ 

^ The columns of Table 3 sIk>w tnctis'sWspecifici^ and diseasis/dlsdrders. or conditions 
10 associated with nucleqUde sequences encoding RECAP. The first column of Table 3 lists the nucleotide 
SEQ ID NOs. Column 2 lists fragments Of the nucleotide sequences of column 1. These fragments are 
useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:23-44 and to 
distinguish between SEQ ID NO:23-44 and related polynucleotide sequences. The polypeptides 
encoded.by these fragments are useful, for example, as immunogenic peptides. Column 3 lists tissue 
15 categories which express RECAP, as a. fraction of total tissues expressing RECAP. Colurnn 4 lists 
diseases, disorders, or conditions associated with those tissues expressing RECAP as a fraction of total 
tissues expressing RECAP: Column 5 lists the vectors used to subclone each cDNA library. Of 
particular interest is the expression of SEQ ID NO:l l in hematopoietic/immune tissues and the 
expression of SEQ ID NO: 14 in reproductive tissues. 
20 • Thecolumrisof Table 4 show descriptions of the tissues used to construct the cDNA libraries 

from which cDN A clones encoding RECAP were isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows the 
tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

The invention also encompasses RECAP variants. A preferred RECAP variant is one which has 
25 at least about 80%, or alternatively at least about 90%, or even aileast about 95% amino acid sequence 
identity to the RECAP amino acid sequence, and which contains at least one functional or structural 

characteristic of RECAP. . 

The invention also encompasses polynucleotides which encode RECAP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 
30 the group consisting of SEQ ID NO:23-44, which encodes RECAP. The polynucleotide sequences of 
SEQ ID NO:23-44, as presented in the Sequence Listing, embrace the equivalent RN A sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is 
composed of ribose instead of deoxyribose. . 

The invention also encompasses a variant of a polynucleotide sequence encoding RECAP. In ; 
35 particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
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about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding RECAP. A particular aspect of the invention encompasses a variant of a polynucleotide . 
sequence comprising a sequence selected from the group consisting of SEQ ID NO:23-44 which has at 
least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide sequence 
5 identity, to a nucleic acid sequence selected from the group consisting of SEQ ID NO:23-44. Any one of 
the polynucleotide variants described above can encode an amino acid sequence which contains at least 
one functional or structural characteristic of RECAP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 
code, a multitude of polynucleotide sequences encoding RECAP, some bearing minimal similarity to the 

10 polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the -> - .■ 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are madejn accordance 
with the standard triplet genetic code as applied to the polynucleotide sequence of naturally occurring 
RECAP, and all such variations are to be considered as being specifically disclosed. . 

15 Although nucleotide sequences which encode RECAP and its variants are generally capable of 

hybridizing to the nucleotide sequence of the naturally occurring RECAP under appropriately selected - 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding RECAP or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring . 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 

20 particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons 
are utilized by the host Other reasons for substantially altering the nucleotide sequence encoding , 
RECAP and its derivatives without altering the encoded amino acid sequences include the production of 
RNA transcripts having more desirable properties, such as a greater half-life, than transcripts produced 
from the naturally occurring sequence. 

25 The invention also encompasses production of DNA sequences which encode RECAP and 

RECAP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the synthetic 
sequence may be inserted into any of the many available expression vectors and cell systems using , 
reagents well known in the art. Moreover, synthetic chemistry may be used to introduce mutations into a 
sequence encoding RECAP or any fragment thereof. 

30 Also encompassed by the invention are polynucleotide sequences that are capable of.hybridizing 

to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID NO:23-44 and 
fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M.and S.L. Berger (1987) 
Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-51 1.) Hybridization 
conditions, including annealing and wash conditions, are described in "Definitions." _ . , 

35 Methods for DNA sequencing are well known in the art and may be used to practice any of the . 
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: embodiments of the invention; The methods may employ such enzymes as the Klenow fragment of DN A ^ 
polymerase I.SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE Biosystems. Foster 
City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech. Piscataway KJ), or combinations 
of polymerases and proofreading exonucieases such as those found in the ELONG ASE amplification 
5 system (Life Technologies. Gaithcrsburg MD). Preferably, sequence preparation is automated with 
" machhWsuch' as" the MICROLAB-2200 liquid transfer system (Hamilton, Reno NV). PTC200JhermaU _ 
cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (PE Biosystems). ■ 
■^eiencingirthen carried out usin^ either uWabI 373 or 377 DNA sequencing system (PE 
Biosystems). the MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics, Sunny vale CA), or 
10 other systems known in the art. Theresulting sequences are analyzed fusing a variety of algorithms which 
are well known in the art, (See, e.g.. Ausubel.F.M. (1997) Short Protocols in Molecular Biology , John 
Wiley & Sons, New York NY, unit 7"?; Meyers, R.A. (1995) Molecular Biology and Biotechnology . 
Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding RECAP may be extended utilizing a partial nucleotide 
15 sequence and employing various PCR-based methods known in the art to detect upstream sequences, such 
as promoters and regulatory elements. For example, one method which may be employed, restriction-site 
PCR, uses universal and nested primers to amplify unknown sequence from genomic DNA within a 
cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) Another method, 
inverse PCR. uses primers that extend in divergent directions to amplify unknown sequence from a 
20 circularized template. The template is derived from restriction fragments comprising a known genomic 
locus and surrounding sequences. (See, e.g.. Triglia. T. et al. (1988) Nucleic Acids Res. 16:8186.) A 
third method, capture PCR. involves PCR amplification of DNA fragments adjacent to known sequences 
inhuman and yeast artificial chromosome DNA. (See, eig., Lagerstrom, M: et aL (1991) PCR Methods 
Applic. 1 :1 1 1-1 19.) In this method, multiple restriction enzyme digestions and ligations may be used to 
25 "insert ah engineered double-stranded sequence into a region of unknown sequence before performing 
PCR. Other methods which may be used to retrieve unknown sequences are known in the art. (See. eg., 
Parker, J.D. et al. (1991) Nucleic Acids Res, 19:3055-3060). Additionally, one may use PCR, nested 
primers, and PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This 
procedure avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR- 
30 basedmethods, primers may be designed using commercially available software, such as. OLIGO 4.06 
Primer Analysis software (National Biosciences, Plymouth MN) or another appropriate program, to be 
about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 
template at temperatures of about 68°C to 72?C. . , ... . ... .' 

When screening for full-length cDNAs, it is preferable to use libraries that have been ; 
35 size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
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sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 
does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' 
non-transcribed regulatory regions. . . . . . , 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
5 size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire process 

10 from loading of samples to computer analysis and electronic data display may be computer controlled. 
Capillary electrophoresis is especially preferable for sequencing small DNA fragments which may be 
1 present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode RECAP may be cloned in recombinant DNA molecules that direct expression of RECAP, or 

15 fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express RECAP. t 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter RECAP-encoding sequences for a variety of purposes including, but not 

20 limited to, modification of the cloning, processing, and/or expression of the gene product. DNA shuffling 
by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be 
used to engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed 
mutagenesis may be used to introduce mutations that create new restriction sites, alter glycosylation 
patterns, change codon preference, produce splice variants, and so forth. 

25 The nucleotides of the present invention may be subjected to DNA shuffling techniques such as 

MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve 
the biological properties of RECAP, such as its biological or enzymatic activity or its ability to bind to 

30 other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 

35 evolution. For example, fragments of a single gene containing random point mutations may be 
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rccombincd, screened; and then reshuffled until the desired properues are opurnized. Alternatively, . 
fragments of a giVen gene n^ 

family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. ■ { : i V ' V 

. ln another embodiment, sequences encoding RECAP may be synthesized, in whole or « part,. 

using Chernical meu,^ well Kno^ - 
■ Symp. Ser: -7:21*223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively. 

synthesis can be performed using various solution-phase or sbli^phase techniques. (See, e.g.. Creighton, 
10 T ri ^ 0 n , itrln- ^H^w,^1.r Properties. WH Freeman, New York NY, pp. 55-60; and 

Roberge, J.Y. et al. (1995) Science 269:202-204.). Automated synthesis may be achieved using the ABI 
431A peptide synthesizer (PE Biosystems). Additionally, the amino acid sequence of REGAP. or any 
• part thereof, maybe altered (luring direct synthesis and/or combined with sequences from other protems. 
or any part thereof, to produce a variant polypeptide or a polypeptide having a sequence of a naturally 

15 occurring polypeptide. > 
The peptide may be substantially purified by preparative ^high performance liquid ^ ^ ^ . 
chromatography. (See, e g., Chiez, R.M. and F.Z. Regnier (,990) Meunods Er^ymol. 182:392-121.) 
The composition^ the synthetic peptides may be confirmed by amino acid analysis or by sequenemg. 

^ (See, e.g.,' Creighton, supra, pp. 28-53.) ■ ; ; 

V In order to express a biologically active RECAP, the nucleotide sequences encoding RECAP, or 

derivatives thereof may be inserted into an appropriate expression vector, i.e.. a vector which contains the 
necessary elements for transcripdonal and translation* control of the inserted coding sequence in a . 
- suitable host. These element, include regulatory sequences, such as enhancers, constitutive and induable 
promoters, and 5' and 3' untranslated regions in ti^^ 
25 RECAP. Such elements may vary in their strength and specificity. Specific initiation signals may also 
be used to achieve more efficient translation of sequences encoding RECAP. Such signals include the 
ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where sequences 
'"' encoding RECAP and its initiation codon and upstream regulatory sequences are inserted into the . . 

appropriate expression vector, no additional transcriptional or translation* control signals rnay be 
30 needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
transiational control signals including an in-frame ATG initiation codon should be provided by the vector 
Exogenous translational elements' and initiation codons may be of various origins, both natural and. . 
synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for 
the particular host cell system used. (See. e.g.. Scharf. D. etal. (1994) Results Probl. Cel. Differ. 
• 35 20:125-162.) 
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. x , Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding RECAP and appropriate transcriptional and translational control 
elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo 
genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning. A Laboratory Manual . 
5 Cold Spring Harbor Press, Plainyiew NY, ch. 4, 8, and 16-17; Ausubcl, F.M. et al. (1995) Current 
Protocols in Molecular Biology , John Wilev & Sons. New York NY. ch. 9 r 13 r and 16 ) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding RECAP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
10 yeast expression vectors; insect cell systems infected with viral expression vectors (e.g„ baculovims); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g. ( Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g.. Sambrook. supra ; Ausubel. supra : Van Heeke. G. and S.M. Schuster 
, (1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 

15 Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Nad. Acad. ^ 
Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene/Ther. 7:1937-1945; Takamatsu, N. (1987) # 
EMBOJ. 6:307-31 1; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science ;} 
224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The McGraw Hillf , f 
Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and a 

20 T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655r3659; and Harrington, JJ. et al. (1997) Nat : f 

Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia | 
viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the / B 

targeted organ, tissue, or cell population. (See, e.g., Di Nicola, M. et al. (1998) Cancer Gen. Then 
5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. USA 90(13):6340-6344; Buller, R.M. et al. 

25 (1985) Nature 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 31(3):219-226; and 
Verma, I.M. and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host cell 
employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending upon 
the use intended for polynucleotide sequences encoding RECAP. For example, routine cloning, 

30 subcloning, and propagation of polynucleotide sequences encoding RECAP can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
(Life Technologies). Ligation of sequences encoding RECAP into the vector's multiple cloning site 
disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 

35 transcription, dideoxy sequencing, singie strand rescue with helper phage, and creation of nested deletions 
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inuteclonedse^ ; 
264 5503.5509,) When large quandties of RECAP S heeded, e.g. for theproducdon of anubodtes.- ^ 

vectors which directmghl^^^ 

strong, inducible T5 or T7bactcrk P tage proper nay be used. 

coniaWngco^udveorindud V- 

promoters. maybe ^itoyaiS t*"^*™^ "****^ ' 

vecto* direct eid*r the secretion or intracenular retenUon of jessed pr«e. K ,„d e raW e in,e g rauo n of 

10 supra; and Scorer, supra.) „ 
^ Pfant systems may also be used for expression of RECAP: Transcription of sequences er.od.ng : 
RECAP may be driven viral promoters. **. me 35S and 19S promoters of CaMV used atone or ,n 

Alternadvely.piamp^^^ 
15 use.. (See. el Coru^; Broglie. and T-e construe, can bemtrcduced 

into P .ant cells by direct DN A transformation or pathogen-mediated transection. (Se* e.g.. J* 
r ■ „„, v^w of T-hnolOEV (1992) McGraw Hill. New YorR NY. PP. 191-196.) . . 

,„ mammalian ce«s, a number of viral-based expression systems may be utill^d. In cases where 
ah adenovirus is used as an expression vector, sequences encoding RECAP may be ligated into an 
20 adenovirus uanscripd„n/.anslaUo„ complex consisting o, the late promoter and triparUte leader 
: sequence. Insertion in a non-essenUal El or E3 region ofthe viral genome may beused 

infective virus which expresses RECAP in host cells: (See, e.g., Logan, J. andT. Shenk (1984) Proc. 
Nad Acad : ScL US A 81:3655-3659.) In addition. uansaipUon enhancers, such as the Rous sarcoma 

25 vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to .leliver larger fragments of 
DN A than can be contained in and expressed front a plasmid. HACsof about 6 *b to 10 Mb are 
consumed and deiivered via convenuona. ddivcry methods (liposomes, potycationic amino polymers, or 
vesicles) for therapeutic purposes; (See. e.g.. Harrington, J.J. el al. (1997) N», Cen«. 15:345-355.) 

j„ F „Mon g termproducUonofrecombir^^ 

RECAP in cell lines is preferred. For example, sequences encoding RECAP can be transformed into cell 

expression elements and a selectable marker gene on the sameo, on a separa K vc« M . FoHowing *e 
reduction of the vector. ceilsmay be altowed to grow for about 1 to 2 days in enriched media before- 
,5 being switched to selective media. Thepurpose of d* selectable mar** is to confer resistance* a 
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selective agent, and its presence allows growth and recovery, of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 
5 but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in tk and apf cells, respectively. (See, e.g., Wigler, M. et al, (1977) Cell 1 1 :223-232; 
Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to chlorsulftiron 
10 and phosphinotricin acetyl transferase, respectively. (See, e.g., Wigler, M. et al. (1980) Proc. Natl. Acad 
Sci. USA 77:3567-3570; Colhere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) Additional 
selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements for 
metabolites. (See, eg., Hartman, S.C and R.C. Mulligan (1988) Proc. Natl, Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), fl - 
15 glucuronidase and its substrate 6-glucuronide, or luciferase and its substrate luciferin may be used. - 
These markers can be used not only to identify transformants, but also to quantify the amount of transient 
or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C. A. (1995) 
Methods Mol. Biol. 55:121-131.) t , ^ 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
20 also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding RECAP is inserted within a marker gene sequence, transformed cells containing 
sequences encoding RECAP can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding RECAP under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
25 expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding RECAP and that express 
RECAP may be identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, andprotein 
bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for 
30 the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of RECAP using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
35 reactive to two non-interfering epitopes on RECAP is preferred, but a competitive binding assay may be 
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employed. These and other, assays are well known in the art. (See, e.g., Hampton, R. et al. (1990) 
Serological Methods, a Laboratory Manual . APS Press, St. Paul MN, Sect. IV; Coligan, J.E. et al. 
f!997> Current Protocols in Immunology . Greene Pub. Associates and Wiley-Interscience, New York 
NY; and Pound, J.D. (1998Y Immunochemical Protocols . Humana Press, Totowa NJ.) 

5 ' A wide variety of labels and conjugation techniques are known by those skilled in the art and 

- may be used in various nucleic acid and amino acid assays^ M 

or PCR probes for detecting sequences related to polynucleotides encoding RECAP include oligolabeling, 
nick translation, end-labeling", or PCR amplification using a labeled nucleotide. Alternatively, the 
sequences encoding RECAP, or any fragments thereof, may be cloned into a vector for the production of 

10 an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of commercially available 
kits, such as those provided by Amershani Pharmacia Biotech, Promega (Madison WI), and US 
Biochemical. Suitable reporter molecules or labels which may be used for ease; of detection include 

15 radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, 
cofactors, inhibitors, magnetic particles, and the like. - 

Host cells transformed with nucleotide sequences encoding RECAP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture! The protein produced 
by a transformed cell may be secreted or retained intracellular^ depending on the sequence and/or the ' 

20 vector used. As will be understood by those of skill in the art, expression vectors containing • 
polynucleotides which encode RECAP may be designedto contain signal sequences which direct 
secretion of RECAP through a prokaryotic or eukaryotic cell membrane. 

/ In addition,- a host cell strain may be chosen for its ability to modulate expression of the inserted 
sequences or to process the expressed protein in the desired fashion. Such modifications of the 

25 polypeptide include, but are not limited to, acetylatioh, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylatiori. Post-traaslatipnal processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells which 
have specific cellular machinery and characteristic mechanisms for post-translational activities (e.g., 
CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture Collection 

30 (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing of the 
foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid sequences 
encoding RECAP may be ligated to a heterologous sequence resulting in translation of a fusion protein in 
any of the aforementioned host systems. For example, a chimeric RECAP protein containing a, . 
35 heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
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screening of peptide libraries for inhibitors of RECAP activity. Heterologous protein and peptide 
moieties may also facilitate purification of fusion proteins using commercially available affinity matrices. 
Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose binding. protein 
(MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin 
5 (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion proteins on 

immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, respectively. 
FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion proteins using 
commercially available monoclonal and polyclonal antibodies that specifically recognize these epitope 
tags. A fusion protein may also be engineered to contain a proteolytic cleavage sitelocated between the 
10 RECAP encoding sequence and the heterologous protein sequence, so that RECAP may be cleaved away 
from the heterologous moiety following purification. Methods for fusion protein expression and 
purification are discussed in Ausubel (1995, supra , ch. 10). A variety of commercially available kits 
may also be used to facilitate expression and purification of fusion proteins. '■ ? / 

In a further embodiment of the invention, synthesis of radiolabeled RECAP may be achieved in 
15 vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems' 
couple transcription and translation of protein-coding sequences operably associated, with the T7, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. ■ _ , : 

RECAP of the present invention or fragments thereof may be used to screen for compounds - 
20 that specifically bind to RECAP. At least one and up to a plurality of test compounds may be T screened 
for specific binding to RECAP. Examples of test compounds include antibodies, oligonucleotides, 
proteins (e.g., receptors), or small molecules. . - 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
RECAP, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
25 natural binding partner. (See, Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which RECAP 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the * 
compound can be rationally designed using known techniques. In one embodiment, screening for these 
compounds involves producing appropriate cells which express RECAP, either as a secreted protein or 
30 on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila . or E. coli . Cells 
expressing RECAP or cell membrane fractions which contain RECAP are then contacted with a test 
compound and binding, stimulation, or inhibition of activity of either RECAP or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
35 detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, the 
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assay may comprise the steps of combining at least one test compound with RECAP, either in solution.. 
•' or affixed to a solid support, and detecting the binding of RECAP to the compound. Alternatively, the 
assay may detect or measure binding of a test compound in the presence of a labeled competitor. ; 
Additionally, the assay may be carried out using cell^free preparaUohs, chemical libraries, or natural 
5 product mixtures, and the test compound(s) may be free in solution or affixed to a solid support. 
- " - " RECAP of the present invention or-fragments thereof may be.used to screen for compounds 
that modulate the activity of RECAP. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for RECAP • 
''activity, wherein RECAP is combined with at least one test compound, and the activity of RECAP in 
10 the presence of a test compound is compared with the activity of RECAP in the absence of the test 
compound. A change in the activity of RECAP in the presence of the test compound is indicative of a 
compound that modulates the activity of RECAP. Alternatively, a test compound is combined with an 
in vitro or cell-free system comprising RECAP under conditions suitable for RECAP activity; and the 
assay is performed. In either of these assays, a test compound which modulates the activity of RECAP 
15 may do so indirectly and need not come in direct contact with the test compound. At least one and up 
to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding RECAP or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem (ES) 
cells. Such techniques are well known in the art and are useful for the generation of animal models of 
20 human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767.337.) For example, 
mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo and 
grown in culture. The ES cells are transformed with a vector containing the gene of interest disrupted 
by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M R. (1989) Science 
244:1288-1292). The vector integrates into me corresponding region of the host genome by 
25 homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 
(1996) Clin Invest. 97:1999-2002; Wagner, K.U. et al . (1997) Nucleic Acids Res. 25:4323-4330). 
Transformed ES cells are identified and miCroinjccted into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and the 
30 resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous strains. 
Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding RECAP may also be manipulated iojjtro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
35 into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 

''35 : 
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(1998) Science 282:1 145-1 147). -.. . . , . ... ... . 

Polynucleotides encoding RECAP can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding RECAP. is. injected into animal ES cells, and the injected sequence . 
5 integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae 
are implanted as described above. Transgenic progeny. or inbred lines are: studied and treated with 
potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, a 
mammal inbred to oyerexpress RECAP, e.g., by secreting RECAP in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnot. Annu. Rev. 4:55-74). 
10 THERAPEUTICS 

_ Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 
regions of RECAP and receptors and associated proteins. . In addition, the expression of RECAP is 
closely associated with cell proliferation, cancer, inflammation/trauma, and with neurological disorders. 
Therefore, RECAP appears to play a role in neurological disorders; immunological disorders, including 

15 autoimmune/inflammatory disorders; and cell proliferative disorders, including cancer. In the treatment 
of disorders associated with increased RECAP expression or activity, it is desirable to decrease the 
expression or activity of RECAP. In the treatment of disorders associated with decreased RECAP 
expressions, activity, it is desirable to increase the expression or activity of RECAP. a 
. Therefore, in one embodiment, RECAP or a fragment or derivative thereof may be 

20 administered to a subject to treat or prevent a disorder associated with decreased expression or.actiyity 
of RECAP. Examples of such disorders include, but are not limited to, a neurological disorder such as 
epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
Down's syndrome, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 

25 muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative 
intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system disease; prion 
diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome; 
fatal familial insomnia, nutritional and nietabolic diseases of the nervous system, neurofibromatosis, 

30 tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 
retardation and other developmental disorders of the central nervous system, cerebral palsy, 
neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis; inherited, metabolic, endocrine, and toxic myopathies; myasthenia 

35 gravis, periodic paralysis; mental disorders including mood, anxiety, and schizophrenic disorders;. 
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seasonal affective disorder (SAD); akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
' dyskinesia; dystonias, paranoid psychoses; postherpetic neuralgia, Tourer's (disorder, progressive , 
•■ supranuclear palsy, corticobasal degeneration, and familial frontotcmporal dementia; ah immunological 
disorder, including autoimmune/inflammatory disorders, such as acquired immunodeficiency syndrome 
5 (AIDS), X-linked agammagiobinemia of Bniton, common variable immunodeficiency (CVI), 

DiGeorge's syndrome (thymic hypoplasia) r thymic dysplasia, isolated Jg\S^;^^_ 
' immunodeficiency disease (SCID), immunodeficiency ^ 
> Alclrich syndrome), Chediak-Higashi syndrome, cr^oiucg 

angioneurotic edema, and immunodeficiency associated with ^ Cushing's disease, Addison's disease. 
10 adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyehdocrinopamy-can^^ 

dermatitis. Crohn ; s disease, atopic dermatitis, dennatomyositis, diabetes mellitus. emphysema, episodic 
lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 

15 glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease* Hashimoto's thyroiditis, : 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or . 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma. Sjbgren's syndrome, systeimc anaphylaxi 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis; Werner 

20 syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
rurigsfl.parasiti^potozoal, and Wtniiitthic infections, hcrnatopoieticcancers, including lymphprna. 
leukemia, and myeloma, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease (MCTD), 
myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary : 

25 thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, 
sarcoma, teratocarcinoma. and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, 
brain, breast, cervix, gall bladder, ganglia, gasuoihtestinal ijact, heart, kidney, liver, lung, m 
ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and 



uterus: 



30 > In another embodiment, a vector capable of expressing RECAP or a fragment or derivative ; 
thereof may be administered to a subject to treat or' prevent a disorder associated with decreased; 
: expression or activity of RECAP including, but not limited to, those described above. ^ , 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
RECAP in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or 
35 prevent a disorder associated with decreased expression or activity of RECAP including, but- not limited 
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to, those provided above. . 

In still another embodiment, an agonist which modulates the activity of RECAP may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of RECAP including, but not limited to, those listed above. . 
5 In a further embodiment, an antagonist of RECAP may be administered to a subject to treat or 

prevent a disorder associated with increased expression or activity of RECAP. Examples of such 
disorders include, but are not limited to, those neurological disorders; immunological disorders, including 
autoimmune/inflammatory disorders; and cell proliferative disorders, including cancer, described above. 
In one aspect, an antibody which specifically.binds RECAP may be used directly as an antagonist or 
10 indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues , 
which express RECAP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide encoding 
RECAP may be administered to a subject to treat or prevent a disorder associated with increased . 
expression or activity of RECAP including, but not limited to, those described above. 

15 In other embodiments, any of the proteias, antagonists, antibodies,agonists, complementary 

sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination of 
therapeutic agents may act synergistically to effect the treatment or prevention of the various disorders 

20 described above. Using this approach, one may be able to achieve therapeutic efficacy with lower 
dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of RECAP may be produced using methods which are generally known in the art. 
In particular, purified RECAP may be used to produce antibodies or to screen libraries of pharmaceutical 
agents to identify those which specifically bind RECAP. Antibodies to RECAP may also be generated 

25 using methods that are well known in the art. Such antibodies may include, but are not limited to, 

polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by 
a Fab expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are generally 
preferred for therapeutic use. , . • * - ■* \ 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and 

30 others may be immunized by injection with RECAP or with any fragment or oligopeptide thereof which 
has immunogenic properties. Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels such as 
aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG (bacilli „ 
.Calmctte-Guerin^ and Corvnebacterium narvum are especially preferable. * -\ 
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- '"'.It is preferred that iheoUgopeptides, peptides, or fragments used to induce antibodicslo ; RECAP \. 
have an amino add sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopep 
• ' . identical to a portionof the amino acid sequence 6t the.natural protein." Short stretches of RECAP amino 
5 acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule 
" " may be produced. z " ~; : ~~ " ''' - - - - - . _ . . , vr . 

• • Monoclonal anUbodies to RECAP may be prepared using any technique which provides for the 

production of antibody molecules by continuous cell lines in culture! These include, but are not limited 
to, the hybridbma technique, the human B-cell hybridoma technique, and the EB V-hybridoma technique. 
10 (See, eg., Kohler, G et al . (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 
81:31-42; Cote, R.J. et al.. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and Cole, S.P. et al. (1984) 

Mol. Cell Biol. 62:109-120.) .-i^ 

In addition, techniques developed for the production of "chimeric anUbodies/* such as the . 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate antigen 
15 specificity and biological activity, can be used. (See, e.g„ Morrison, S.L. et al. (1984) Proc. Natl. Acad. 
Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, S. et al. 
(1985) Nature 314:452-454.) Alternatively, techniques described for the production of single chain 
antibodies may be adapted, using methods known in the art, to produce RECAP-specific single chain 
• antibodies/ Antibodies with related specificity, but of distinct idiotypic composition, may be generated by 
20 chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, D.R. (1991) 
Proc. Natl. Acad. Sci. USA 88:10134-10137.) : 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population or 
by screening immunoglobulin libraric» :or panels of highly specific binding reagents as disclosed in the 
literature. (See, e.g., Ortandi, R. et al. (1989) Proc.. Natl. Acad: Sci. USA 86:3833^3837; Winter, G^et 
25 al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for RECAP may also be generated. For 
example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin digestion 
of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab , )2 
fragments! Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
30 identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 
(1989) Science 246:1275-1281.) \ 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or iirimunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in 
35 immunoassays typically involve the measurement of complex formation between RECAP and its specific 
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antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering RECAP epitopes is generally used, but a competitive binding assay may also be employed 
(Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
5 may be used to assess the affinity of antibodies for RECAP. Affinity is expressed as an association 
constant, K,, which is defined as the molar concentration of RECAP-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The K, determined 
for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple 
RECAP epitopes, represents the average affinity, or avidity, of the antibodies for RECAP. The K, 
10 determined for a preparation of monoclonal antibodies, which are monospecific for a particular RECAP 
epitope, represents a true measure of affinity. High-affinity antibody preparations with K, ranging from 
about 10 9 to 10 12 L/mole are preferred for use in immunoassays in which the RECAP-antibody complex 
must withstand rigorous manipulations. Low-affinity antibody preparations with ranging from about 
10 6 to 10 7 L/mole are preferred for use in immunopurification and similar procedures which ultimately 
15 require dissociation of RECAP, preferably in active form, from the antibody (Catty, D. (1 988) 

Antibodies. Volume I: A Practical Approach . IRL Press, Washington DC; Liddell, J.E. and A. Cryer 
(1991) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
20 polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of RECAP-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for antibody 
quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and Coligan et 
al.. supra .) 

25 In another embodiment of the invention, the polynucleotides encoding RECAP, or any fragment 

or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding RECAP. 
Such technology is well known in the art, and antisense oligonucleotides or larger fragments can be 

30 designed from various locations along the coding or control regions of sequences encoding RECAP. 
(See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics . Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellular^ in the form of an expression plasmid which, upon transcription, produces a sequence 

35 complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
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Slater, J.E. ct al..(1998) J. Allergy Ciiii. Immunol. 1 02(3) :4 69^75; and Scanlon, K.J. et al. (1995) 
9(13):1288-1296.) Antisense sequences can also be introduced intracellular^ through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A D. (1990) Blood 
r 76:271; Ausubel, supra : Uckert,; W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
5 gene deli very mechanisms include liposome-derived systems, artificial viral envelopes, and other 

- systems kiiownan the a^ et al. t . 

(1998) J. Pharm. Sci. 87(1 1):1308-1315; and Morris, M.C. et aK (1997) Nucleic, Acids Res. 
25(14):2730-2736.) v ,.;; ; : -- : : .■ r\-/ - - ' ; ; - :/ " - v : ' \ \ -~- - 

In another embodiment of the invention, polynucleotides encoding RECAP may be used for 

10 somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g M in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X-linked 
. inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency (Blaese, 
R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), cystic 

15 fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal; R.G. et al. (1995) Hum. Gene Therapy 6:643- 
666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703),' thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, R.G. 
(1995) Science 270:404-410; Verrrta, I.M; and Somia, N. (1997) Nature 389:239-242)), (ii) express a 
conditionally lethal gene product (e.g., in the case of cancers which result from unregulated cell .'.=._. 

20 proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g„ 
against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) Nature 
335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:11395-11399), hepatitis B or C 
virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides brasiliensis : and 
protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi) . In the case where a 

25 genetic deficiency in RECAP expression or regulation causes disease, the expression of RECAP from an 
appropriate population of transduced cells may alleviate the clinical manifestations caused by the genetic 
deficiency. ; 

; In a further embodiment of the invention, diseases or disorders caused by deficiencies in RECAP 
are treated by constructing mammalian expression vectors encoding RECAP and introducing these 

30 vectors by mechanical means into RECAP-deficient cells. Mechanical transfer technologies for use with 
cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold 
particle delivery, (iii) liposome-mediated trarisfection, (iv) receptor-mediated gene transfer, and (v) the 
use of DNA trarisposons (Morgan, R.A. and W.F. Anderson (1993) Aiinu. Rev. Biochem. 62:191-217; 
Ivies, Z, (1997) Cell 91:501-510; Bdulay, J-L. and H. R6cipon (1998) Curr. Opin. Biotechnol. 9:445- 

35 450). > • : \- 7- '.' ^-"f V;' :; . 
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: Expression vectors that. may be effective for the expression of RECAPinclude, but are not 
limited to, the PCDNA 3.1. EPITAG. PRCCMV2, PREP, PVAX vectors (Invitrogen. Carlsbad CA), 
PCM V-SCRIPT, PCM V-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, PTET-ON, 
PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto. CA). RECAP may be expressed using (i) a 
5 comtitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus (RSV), SV40 
virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g., the tetracycline-regulated 
promoter (Gossen, M. and H. Bujard (1 992) Proc.*Natl. Acad. Sci. USA 89:5547-555 1 ; Gossen, M. et 
al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) Curr. Opia Biotechnol. 9:451 - 
456), commercially available in the T-REX plasmid (Invitrogen)); the ecdysone-inducible promoter 
10 .(availabIeintheplasmi^PVGIOCRandPIND;Invitrogen);theFK506/rapamycinindu 

or the RU486/mifepristone inducible promoter (Rossi, F.M.V. and H.M. Blau. supra) ), or (iii) a tissue- 
specific promoter or the native promoter of the endogenous gene encoding RECAP from a normal * 
individual. . - 

Commercially available liposome transformation kits (e!g., the PERFECT LIPID 
15 TRANSACTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental parameters. 
In the alternative, transformation is performed using the calcium phosphate method (Graham, F.L and 
A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. (1982) EMBO'J. - * 
1 :84 1-845). The introduction of DNA to primary cells requires modification of these standardized 
20 mammalian transfection protocols. . . . , . 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to RECAP expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding RECAP under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
25 element (RRE) along with additional retrovirus exacting RNA sequences and coding sequences required 
for efficient vector propagation. Retrovirus vectors (e g., PFB and PFBNEO) are commercially available 
(Stratagene) and are based on published data (Riviere. I. et al. ( 1 995) Proc. Natl. Acad. Sci. USA 
92:6733-6737). incorporated by reference herein. The vector is propagated in an appropriate vector 
producing cell line (VPCL) that expresses an envelope gene with, a tropism for receptors on the target 
30 cells or a promiscuous envelope protein such as VS Vg (Armentano. D. et al. (1987) J. Virol. 61 :1647- 
1650; Bender. M A. et al. (1987) J. Virol. 61:1639-1646; Adam. M.A. and A.D. Miller (1988) J. Virol. 
62:3802-3806; Dull. T. et al. (1998) J. Virol. 72:8463-8471; Zufferey. R. et al. (1998) J. Virol. 72:9873- 
9880). U.S. Patent Number 5,910.434 to Rigg ("Method for obtaining retrovirus packaging cell lines 
producing high transducing efficiency retroviral supernatant") discloses a method for obtaining retrovirus 
35 packaging cell lines and is hereby incorporated by reference. Propagation of retrovirus vectors. ■ - 
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transduction of a population of cells (e.g., CD4* T-cells), and the return of transduced cells to a patient 
are procedures well known to persons skilled in the art of gene therapy and have been well documented 
(Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, 
M.L. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; 
5 Su, L. (1997) Blood 89:2283-2290). 
;* " " Inthe alternative, anadehbvirus-based gene therapy delivery system is used to deliver - ~ '* 
polynucleotides encoding RECAP to cells which have one or more genetic abnormalities with respect to 
the expression of RECAP. The construction and packaging of adenovirus-based vectors are well known 
to those with ordinary skill in the art. Replication defectivie adenovirus vectors have proven to be 

10 versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas (Csete, 
M.E. etal. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are described in 
U.S. Patent Number 5,707,61 8 to Arnientano ("Adenovirus vectors for gene therapy"), hereby 
incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) Annu. Rev. 
Nutr. 19:511-544; and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, both incorporated by 

15 reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding RECAP to target cells which have one or more genetic abnormalities with 
respect to the expression of RECAP. The use of herpes simplex virus (HS V)-based vectors may be 
especially valuable for introducing RECAP to cells of the central nervous system, for which HSV has a 

20 tropism. The construction and packaging of herpes-based vectors are well known to those with ordinary 
skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has been used 
to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 169:385-395). The 
construction of a HSV-1 virus vector has also been disclosed in detail in U.S. Patent Number 5,804,413 
to DeLuca ("Herpes simplex virus strains for gene transfer"), which is hereby incorporated by reference. 

25 U.S. Patent Number 5,804,41 3 teaches the use of recombinant HS V d92 which consists of a genome 
containing at least one exogenous gene to be transferred to a cell under the control of the appropriate 
promoter for purposes including human gene therapy. Also taught by this patent are the construction and 
use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. For HSV vectors, see also Coins, 
W.F. et al. (1999) J. Virol. 73:5 19-532 and Xu, H. et al. (1994) Dev. Biol. 163:152-161, hereby 

30 incorporated by reference. The manipulation of cloned herpesvirus sequences, the generation of 

recombinant virus following the transfection of multiple plasmids containing different segments of the 
large herpesvirus genomes, the growth" and propagation of herpesvirus, and the infection of cells with 
herpesvirus are techniques well known to those of ordinary skill in the art. : 
In another alternative, an alpha virus (positive, single-stranded RNA virus) vector is used to 

35 deliver polynucleotides encoding RECAP to target cells. The biology of the prototypic alphavirus, 
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Semliki Forest Virus (SFV). Has been studied extensively and gene transfer vectors have been based on 
the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotech. 9:464-469). During alphavirus 
RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid proteins. This 
subgenomic RNA replicates to higher levels than the full-length genomic RNA, resulting in the 
5 overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., protease and 
polymerase). Similarly, inserting the coding sequence for, RECAP into the alphavirus genome in place of 
the capsid-coding region results in the production of a large number of RECAP-cpding RNAs and the 
synthesis of high levels of RECAP in vector transduced cells. .While alphavirus infection is typically 
. associated with cell lysis within a few days, the ability to establish a persistent infection in hamster 

10 normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic replication of 
alphaviruses can be altered to suit the needs of the gene therapy application (Dryga, S. A. et al. (1997) 
Virology 228:74-83). The wide host range of alphaviruses will allow the introduction of RECAP into a 
variety of cell types. The specific transduction of a subset of cells in a population may require the sorting 
of cells prior to transduction. The methods of manipulating infectious cDNA clones of alphaviruses, 

15 performing alphavirus cDNA and RNA transfections, and performing alphavirus infections, are well - - 
known to those with ordinary skill in the art. .... .• . ' " .... 

Oligonucleotides derived from the transcription initiation site^e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can be 
achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 

20 inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, ^ 

transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have been 
described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, Molecular and 
Immunoloeic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177.) A complementary sequence 
or antisense molecule may also be designed to block translation of mRNA by preventing the transcript 

25 from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of thexibozyme . 
molecule to complementary target RNA, followed by endonucleolytic cleavage. - For example, engineered 
hammerhead motif ribozyme molecules may specifically and efficiently catalyze endonucleolytic cleavage 

30 of sequences encoding RECAP. ' ..... . .. # 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by. 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, GUU, 
and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, corresponding to 
the region of the target gene containing the cleavage site, may be evaluated for secondary structural 

35 features which may render the oligonucleotide inoperable. The suitability of candidate targets may also - 
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be evaluatcd by tesUrig acc^sibility to hybridizaUon with. complementary oligonucleotides using 

ribonuclease protection assays. 
- ■ , - V' : Complementary ribonucleic acid molecules and ribozymes of toe invention may be prepared by 

any method knoWn in the art for the s^^^^ 
5 chemically synthesizing oligonucleoudes such as solid phase phosphpramidite chemical synthesis. 
- ~ ^jef haUvelyr RNArrholecules-may be-geherated by in vitro and in vivo transcription of DNA sequences . ; 
encoding REC AP. Such DNA sequences may be incorporated into a Wide variety of vectors with : 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA, constituti vely or inducibly. can be introduced into cell lines, cells, or 

10 tissues. " .. - ■ .' 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to. the addition of flanking sequences at the 5* and/or 3' ends of 
'■ the molecule, of the use of phosphorothioate or 2'O-methyl rather than phosphodiesterase linkages within 
the backbone of the molecule. This concept is inherent in the production of PNAs and can be extended in 
15 all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and wybutosine, 
as well as acetyl-, methyl-, thio-, -and similarly modified forms of adenine, cytidine, guanine, thymine, and 
uridine which are not as easily recogiiized by endogenous endonucleases. . ' 

An Additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding RECAP. Compounds 
. 20 which may be effective in altering expression of a specific polynucleotide may include, but are not 
v limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, , 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular chemical 
entities which are capable of interacting with specific polynucleotide sequences. Effective compounds^ 
may alter polynucleotide expression by acting as either inhibitors of promoters of polynucleotide : 
25 ^ expression. Thus, in the treatment of disorders associated with increased RECAP expression or 
activity, a compound which specifically inhibits expression of the polynucleotide encoding RECAP 
may be therapeutically useful, and in the treament of disorders associated with decreased RECAP 
' •'•.'.expression or activity; a compound- which specifically promotes expression of the polynucleotide 

encoding RECAP may be therapeutically useful. 
30 : At least one, and up to a plurality, of test compounds may be screened for effectiveness in 

altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, cbmmercially-avail able or proprietary 
library, of naturally-occurring or non-natural chemical compounds; rational, design of a compound based 
35 on chemical and/or structural properties of the target polynucleotide; and selection from a library of 
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chemical compounds created combinatorially or randomly. A sample comprising a polynucleotide 
encoding RECAP is exposed to at least one test compound thus obtained. The sample may comprise, 
for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted biochemical system. 
Alterations in the expression of a polynucleotide encoding RECAP are assayed by any.methqd 
5 commonly known in the art. Typically, the expression of a specific nucleotide is detected by 
hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding RECAP. The amount of hybridization may be quantified, thus forming the 
basis for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a test 

10 compound indicates that the test compound is effective in altering the expression of the polynucleotide. 
A screen for a compound effective in altering expression of a specific polynucleotide can be carried 
out, for example, using a Schizosaccharomvces pombe gene expression system (Atkins, D. et al. (1999) 
U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 28:E15) or a human cell line 
such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. Commun. 268:8-13). A 

15 particular embodiment of the present invention involves screening a combinatorial library of 

oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and modified 
oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bruice, T.W. etal. 
(1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable for 

20 use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient. Delivery 
by transfection, by liposome injectioas, or by polycationic amino polymers may be achieved using 
methods which are well known in the art (See, e.g., Goldman, C.K. et al. (1997) Nat. Biotechnol. 
15:462-466.) 

25 Any of the therapeutic methods described above may be applied to any subject in need of such 

therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
composition which generally comprises an active ingredient formulated with a pharmaceutical^ 
30 acceptable excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and 

proteins. Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such pharmaceutical 
compositions may consist of RECAP, antibodies to RECAP, and mimetics, agonists, antagonists, or 
inhibitors of RECAP. 

35 The pharmaceutical compositions utilized in this invention may be administered by any number 
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of routes including. 1>ut nbt^itiiiedto-oral; intravenous, intramuscular. intra-artcriaViiuramedullary, 
intrathecal, intraventricular, pulmonary, uansdermal, subcutaneous, intraperitoneal, intranasal, enteral, 
topical, sublingual, or rectal means. 

' Pharimceuti^ pulmonary administration may be' prepared- in liquid or dry . 

5 - powder' form. • mse.cbmpositionS arc generally aerosolized 'immediately prior to inhalation by the • 
- patient. in Wcase of small molecule 

delivery of fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. larger 
' peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar region of. 

the lung have enabled the practical delivery of drugs such as insulin to blood circulation (see. e.g., Patton, 
10 J.S. et al., U.S. Patent No. 5;997.848). Pulmonary delivery has the advantage of administration without 
needle injection, and obviates the need for r^tenUally toxic penetration eiihancers. 

PharmaceuUcal composiUons suitable for use in the invention include compositions wherein the 
"active ingredients are contained in an effective amount'to acWeve.the intended'pu'rpose. ^ The 
determination of an effective dose is well within the capability of those skilled in the art. 
15 Specialized forms of pharmaceutical compositions may be prepared for direct intracellular ■ 

delivery of macromolecules comprising RECAP or fragments thereof. For example, liposome 
preparations-containing a ce^n.permeable^c^ecuie may promote cen.fi.sidn and intracellular . . 
. dehv^ Alternatively, RECAPor a fragment thereof may be joined to a short 

catiohic N-terrninal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found 
20 to transduce into the cells of all tissues, including the brain, in.a mouse model system (Schwarze, S.R. et 
: al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e g., of neoplastic cells, or in animal models such as mice. rats, rabbits, dogs, monkeys, or 
pigs. An animal model may also be used to determine the appropriate concentration range and route of 
25 administration. :' Sucn information.^^ 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example RECAP 
or fragments thereof, antibodies of RECAP, and agonists, antagonists or inhibitors of RECAP, which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by standard 
30 pharmaceutical procedures in cell cultures^ with experimental animals, such as by calculating the ED S0 
(the dose therapeutically effective in 50%, of depopulation) or LD 50 (me dose ^ 
population) statistics. The dose ratio of toxic to therapeutic effects is the therapeutic index, which can be 
expressed as the LD 50 /ED 50 ratio. PrWrnafceuticarcOmpositions which exhibit large therapeutic indices 
. are preferred. The data obtained from cell culture assays and ar^^ 
35 range of dosage for human use. The dosage contained in such compositions is preferably within a range 
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of circulating concentrations that includes the ED 50 with little or no toxicity. The dosage varies within 
this range depending upon the dosage form employed, the sensitivity of the patient, and the route of 
administration. 

- The exact dosage will be determined by the practitioner, in light of factors related to the subject 
5 requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active - 
moiety or to maintain the desired effect. Factors which may be taken into account include the.severity of 
. the disease state, the general health of the subject, the age, weight, and gender of the subject, time and 
frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. Long- 
. acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or biweekly 
10 depending on the half-life and clearance rate of the particular formulatioa 

Normal dosage amounts may vary from about 0.1 ^g to 100,000 fig, up to a total dose of about 
1 gram, depending upon the route of administratioa Guidance as to particular dosages and methods of 
delivery is provided in the literature and generally available to practitioners in the art Those skilled in 
the art will employ different formulations for nucleotides than for proteins or their inhibitors. Similarly, 
15 delivery of polynucleotides or polypeptides will be specific to particular cells, conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind RECAP may be used for the diagnosis 
of disorders characterized by expression of RECAP, or in assays to monitor patients being treated with 
RECAP or agonists, antagonists, or inhibitors of RECAP. Antibodies useful for diagnostic purposes 

20 may be prepared in the same manner as described above for therapeutics. . Diagnostic assays for RECAP 
include methods which utilize the antibody and a label to detect RECAP in human body fluids or in 
extracts of cells or tissues. The antibodies may be used with or without modification, and may be labeled 
by covalent or non-covalent attachment of a reporter molecule. A wide variety of reporter molecules, 
several of which are described above, are known in the art and may be used. - 

25 A variety of protocols for measuring RECAP, including ELIS As, RIAs, and FACS, are known . 

in the art and provide a basis for diagnosing altered or abnormal levels of RECAP expression. Normal or 
standard values for RECAP expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, for example, human subjects, with antibody to RECAP under . 
. conditions suitable for complex formation. The amount of standard complex formation may be 

30 quantitated by various methods, such as photometric means. Quantities of RECAP expressed in subject, 
control, and disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding RECAP, may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 

35 complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and . . 
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quantify gene expression in biopsied tissues in which expression of RECAP may be correlated with 
: disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
RECAP, and to monitor regulation of RECAP levels during therapeutic intervention. - ' . 

In one aspect, hybridization' with PCR probes; which are capable of detecting polynucleoUde 
5 sequences, including genomic sequences, encoding RECAP or closely related molecules' may be used to 
^ ^identify nucleic acid sequences which encode REGAJV -The specificity of- the probe, -whether it is made _ 
from a highly specific region, e.g., the 5* regulatory region, or from a less specific region, e.g., a 
■conserved motif, and the stringency of the hybridizatioh or amplification will determine whether the probe 
identifies only naturally occurring sequences encoding RECAP, allelic variants, or related sequences. 
10 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to ^ 

invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:23-44 or from 
genomic sequences including promoters, enhancers, and introns of the RECAP gene. 

Means for producing specific hybridization probes for DN As encoding R:EC AP include the 

15 cloning of polynucleotide sequences encoding RECAP or REC AP derivatives into vectors for the 

production of mRNA probes. Such vectors are known in the art, are commercially available, and may be 
used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerases 
and the appropriate labeled nucleotides. . HybridizaUon probes may be labeled by a variety of reporter 
groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as alkaline 

20 phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

PolynucleoUde sequences encoding RECAP may be used for the diagnosis of disorders associated 
with expression of RECAP. Examples of such disorders include, but are not limited to, a neurological 
disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 

25 disorders, Down's syndrome, amyotrophic lateral sclerosis and other motor neuron disorders, 

progressive neural muscular atrophy, reUniUs pigmentosa, hereditary ataxias, multiple sclerosis and 
other demyelinaUng diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural 
abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease; prion diseases including kuni, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker 

30 syndrome; fatal familial insomnia, nutritional and metabolic diseases of the nervous system, 

neurofibromatosis, tuberous sclerosis, cerebelloretinal hemahgioblastomatosis, enccphalotrigeminal 
syndrome, mental retardation and other developmental disorders of the central nervous system, cerebral 
palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 

35 dermatomyositis and polymyositis; inherited, metabolic," endocrine, and toxic myopathies; myasthenia 
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gravis, periodic paralysis; mental disorders including mood, anxiety, and schizophrenic disorders; 
seasonal affective disorder (SAD); akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; an immunological 
5 disorder, including autoimmune/inflammatory disorders, such as acquired immunodeficiency syndrome 
(AIDS), X-linked agammaglobinemia of Bruton, common variable immunodeficiency (CVI), 
DiGeorge's synclrome (thymic hypoplasia), thymic dysplasia, isolated IgA deficiency, severe combined 
immunodeficiency disease (SOD), immunodeficiency with thrombocytopenia and eczema (Wiskott- 
Aldrich syndrome), Chediak-Higashi syndrome, chronic granulomatous diseases, hereditary 

10 angioneurotic edema, and immunodeficiency associated with Cushing's disease, Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic 

15 lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves* disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 

20 lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
fungal, parasitic, protozoal, and helminthic infections, hematopoietic cancers, including lymphoma, 
leukemia, and myeloma, and trauma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease (MCTD), 

25 myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 

thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, 
sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, 
brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, 
ovary, pancreas, parathyroid, penis, prostate, salivary glands,' skin, spleen, testis, thymus, thyroid, and 

30 uterus. The polynucleotide sequences encoding RECAP may be used in Southern or northern analysis, 
dot blot, or other membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat 
ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to detect altered RECAP 
expression. Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding RECAP may be useful in assays that 

35 detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
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sequences encoding RECAP -T^^ii^^^^ni^ and added to a fluid or tissue sample 
from a patient under conditions suitable for 1 the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard value. 
If uie amount of signal in the patient sample.is significantly altered in comparison to a control sample 
5 then the presence of altered levels of nucleotide sequences encoding RECAP in the sample indicates the 
presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a particular 
th^apeuUq treaiment regimen in animal studies, in clinical trials, or ti> mord^ thc treatinent of ^ 

• individual patient. . . . ' • 

In order to provide a basis for the diagnosis of a disorder associated with expression of RECAP. 
10 a normal or standard profile for expression is established. This may be accomplished by combining body 
fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a fragment 
thereof, encoding RECAP, under conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained from normal subjects with values from 
an experiment in Which a-knbwn amount of a. substantially purified poiynucleotide is used. Standard 
15 values obtained in this manner may be compared with values obtained from samples from patients who 
are symptomatic for a disorder: Deviation from standard values is used to establish the presence of a 

disorder. 

K once the presence of a disorder is established and a treatment protocol is initiated, hybridization 
assays may be repeated on a regular basis to determine if the level of expression in the patient begins to 
20 approximate that which is observed in the normal subject The results obtained from successive assays 
may be used to show the efficacy of treatment over a period ranging from several days to months. 

With respect to cancer, the presenceof an abnormal amount of traiiscript (either una^- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development of 
the disease, or may provide a means for detecting the disease prior to the appearance of actual clinical 
25 symptoms. A more definitive diagnosis of this type may allow health professionals to employ 

preventative measures or aggressive treatment earlier thereby preventing the development or further 

progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequence 

may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, 
30 or produced in^. Oligomers will preferably contain a fragment of a polynucleotide encoding 

RECAP, or a fragment of a polynucleotide complementary to the polynucleotide encoding RECAP, and 
will be employed under optimized conditions for identification of a specific gene or condition. Oligomers 
may also beemployed under less stringent conditions for detection or quantification of closely related 

DNA or RNA sequences. - 
35 In aparticular aspect, oligonucleotide primers derived from the polynucleotide sequences 
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encoding RECAP may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease in 
humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
5 derived from the polynucleotide sequences encoding RECAP are used to amplify DNA using the 
polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary 
and tertiary structures of PCR products in single-stranded form, and these differences are detectable 
using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are fluorescently 
10 labeled, which allows detection of the amplimers in high-throughput equipment such as DNA sequencing 
machines. Additionally, sequence database analysis methods, termed in silico SNP (isSNP), are capable 
of identifying polymorphisms by comparing the sequence of individual overlapping DNA fragments 
which assemble into a common consensus sequence. These computer-based methods filter out sequence 
variations due to laboratory preparation of DNA and sequencing errors using statistical models and 
15 automated analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and ^ 
characterized by mass spectrometry using, for example, the high throughput MASSARRAY system I 
(Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of RECAP include radiolabeling or • \ > ". . • 4 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from .£ 
20 standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 

al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be ? 4 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 'k 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

25 In further embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotide sequences described herein may be used as elements on a microarray. The microarray can 
be used in traascript imaging techniques which monitor the relative expression levels of large numbers of 
genes simultaneously as described in Seilhamer, J.J. et al., "Comparative Gene Transcript Analysis," 
U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be used to 

30 identify genetic variants, mutations, and polymorphisms. This information may be used to determine 
gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used to 
develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 

35 treatment regimen for that patient. For example, therapeutic agents which are highly effective and 
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display the fewest side effects may fee selected for a' patient based on his/her pharmacogenoniic profile. 
" In another embodiment; antibodies specific for RECAP, or RECAP or fragments thereof may be 
: used as elements on a microarray. The microarray may be used tomonitor or measure protein-protein 
interactions, drug-target interactions, and gene expression profiles, as described above. 
5 A particular embodiment relates to the use of the polynucleotides of the present invention to 

generate- a transcript image of a tissue or cell type, A transcript image represents the global pattCT 
gene expression by a parUcular tissue or cell type. Global gene expression patterns>e analyzed by 
quanUfyihg the number of expressed genes and their ^ relauve abundance under given conditions and at a 
given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent Number 
10 5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of transcripts 
or reverse transcripts of a particular tissue or cell type. In one embodiment, the hybridization takes place 
in high-throughput format, wherein the polynucleotides of the present invention or their complements 
comprise a subset of a plurality of elements on a microarray. The resultant transcript image would 
15 provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in_yjyo, as in the case 
of a tissue or biopsy sample, or mvitrg, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present invention 
20 may also be used in conjunction with |n_yjtro model systems and preclinical evaluation of 

pharmaceuticals, as well as toxicologic^ testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently. termed molecular 
fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity (Nuwaysir, 
E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson (2000) Toxicol. Lett. 112- 
25 1 1 3:467-471, expressly incorporated by reference herein). If a test compound has a signature similar to 
that of a compound with known toxicity, it is likely to share those toxic properties. These fingerprints or 
signatures are most useful and refined when they contain expression information from a large number of 
genes and gene families. Ideally, a genome-wide measurement of expression provides the highest quality 
signature. Even genes whose expression is not altered by any tested compounds are important as well, as 
30 the levels of expression of these genes are used to normalize the rest of the expression data. The 
normalization procedure is useful for comparison of expression data after treatment with different 
compounds: While the assignment of gene function to elements of a toxicant signature aids in 
interpretation of toxicity mechanisms, knowledge of gene function is not necessary for the statistical 
matching of Signatures which leads to prediction of toxicity." (See, lor example. Press Release 00-02 
35 from the National Institute of Environmental Health Sciences, released February 29, 2000, available at 
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http://www.nichs.nih.gov/c>c/news/toxchip.htm.) Therefore, it is important and desirable in toxicological 
screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
5 biological sample are hybridized with one or more probes specific to the polynucleotides of the present . 
invention, so that transcript levels corresponding to the polynucleotides of the present invention may be 
quantified. The transcript levels in the treated biological sample are compared withjevels in an 
untreated biological sample. Differences in the transcript levels between the two, samples are indicative 
of a toxic response caused by the test compound in the treated sample. 

10 Another particular embodiment relates to the use of the polypeptide sequences of the present 

invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global pattern 
of protein expression in a particular tissue or cell type. Each protein component of a proteome can be 
subjected individually to further analysis. Proteome expression patterns, or profiles, are analyzed by 
quantifying the number of expressed proteins and their relative abundance under given conditions and at a 

15 given time. A profile of a cell's proteome may thus be generated by separating and analyzing the 
. polypeptides of a particular tissue or cell type. In one embodiment, the separation is achieved using two- 
dimensional gel electrophoresis, in which proteins from a sample are separated by isoelectric focusing in 
the first dimension, and then according to molecular weight by sodium dodecyl sulfate slab gel , v 
electrophoresis in the second dimension (Steiner and Anderson, supra) . The proteins are visualized in the 

20 gel as discrete and uniquely positioned spots, typically by staining the gel with an agent such as 
Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is generally ^ 
proportional to the level of the protein in the sample. The optical densities of equivalently. positioned 
protein spots from different samples, for example, from biological samples either treated or untreated 
with a test compound or therapeutic agent, are compared to identify any changes in protein spot density 

25 related to the treatment. The proteins in the spots are partially sequenced using, for example, standard 
methods employing chemical or enzymatic cleavage followed by mass spectrometry. The identity of the 
protein in a spot may be determined by comparing its partial sequence, preferably of at least 5 contiguous 
amino acid residues, to the polypeptide sequences of the present invention. In some cases, further 
sequence data may be obtained for definitive protein identification. 

30 A proteomic profile may also be generated using antibodies specific for RECAP to quantify the 

levels of RECAP expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting the 
levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 270:103-1 11; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 

35 methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 
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: reactive fluorescent compound and detecting the amount of fluorescence bound at each array element. . 
Toxicant signatures at the protebrne level are also useful for toxicological screening, and should 
be analysed in parallel wUh toxicant signatures^ :. 
between transcript and protein abundances for some proteins in some Ussues (Anderson ; N.L;andJ,. 
5 Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useM in the 
- analysis of compounds which danotsignificanUy.^ ^ : - . 

proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradaUon of rnRNA, so prbteom be more reliable and informative in such cases. . . . 

In another embodiment, the toxicity of a test compound is assessed by ueating a biological 
10 sample containing proteins with the test compound. Proteins that are expressed in the treated biological 
sample ^are separated so that the amount of each protein can be quantified. The amount of each protein is 
compared to the 'amount of the^esponding protein in in untreated biol^isainple.. A difference in 
the amount of protein between the'two samples is UKficative of a toxic response to the test wnipoui^ in 
me treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
15 in^vidual proteins and collaring' th«epaWal.se^eiK«- to. the polypeptite of theVesent invent 
'• iaanou^-.mbodimeiil, the toxicity of a test compound is assessed by treating a biological' . • 
sample containing proteins with the test compound. Proteins from the biological sample are incubated 
.with anUbodies specific to the polypepUdes of the present invention, .The amount : 6f protein recognized ; 
by the anUbodies is quantified. The amount of protein in the treated biological sample is compared with 
20 me amount in ah untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art (See, e.g., 
Brennan, TM. et al. (1995) U.S. Patent No. 5.474,796; Schena, M. et al. (1996) Proc. NatJ. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon. D. et al. 
25 (1995) PCT appUcaUon WO95/35505; Heller, R.A. et al: (1997) Proc. Nafl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605.6620 Various types of microarrays are well 
known and thoroughly described in DNA Micrbarravs A Practical Approach, M. Schena, ed. (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. / 

In another embodiment of the invention; nucleic acid sequences encoding RECAP may be used to 
30 generate hybridization probes useful in mapping me naturally occurring genomic sequence Either 
coding or noncodirig sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members of 
a multi-gene family may potentially cause undesired cross hybridization during chromosomal mapping. 
; The' sequences may be mapped to a particular chromosome, to a specific region ofa chromosome, or to 
35 artificial chromosome construcUons, e.g., human artificial chromosomes (HACs), yeast artificial 
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chromosomes (Y AGs), bacterial artificial chromosomes (BACs), bacterial PI constructions, or single 
chromosome cDNA libraries. (See, e.g., Harrington, J J. et al. (1997) Nat. Genet. 15:345-355; Price, 
CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 7:149-154.) Once mapped, 
the nucleic acid sequences of the invention may be used to develop genetic linkage. maps, for example, 

5 which correlate the inheritance of a disease state with the inheritance of a particular chromosome region 
or restriction fragment length polymorphism (RFLP). (See, e.g., Lander, E.S. and. D. Botstein ( 1 986) 
Proc. Natl. Acad. Sci. USA 83:7353-7357.) . 
[' Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 

data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, su£^, pp. 965-968.) Exampl^ of geneUc map 

10 data can be found in various scientific journals or- at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding RECAP on a physical map 
and^a specific disorder, or a predisposition to a specific disorder^ may help define the region of DN A , 
associated with that disorder and thus may further positional cloning efforts. 

In_situ . hybridization of chromosomal preparations and physical mapping techniques, such as 

15 linkage analysis using established chromosomal markers, may beused.for extending genetic maps. Often . 4 
the placement of a gene on the chromosome of another mammalian species, such as mouse, may reveal ,?|| 
associated markers even if the exact chromosomal locus is not known. This information is valuable to ^ § 

investigators searching for disease genes using positional cloning or other gene discovery techniques. *f 
Once the gene or genes responsible for a disease or syndrome have been crudely localized by genetic r 

20 linkage to a particular genomic region, e!g., ataxia-telangiectasia to 11 q22-23, any sequences mapping to { 
that area may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, R.A. et -:| 
al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may also be used to ~ v|| 

detect differences in the chromosomal location due to translocation, inversion, etc., among normal, / 
carrier, or affected individuals. . _ 

25 In another embodiment of the invention, RECAP, its catalytic or immunogenic fragments, or 

-oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug / 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellular^. The formation of binding complexes 
between RECAP and the agent being tested may be measured. , . _ 

30 Another technique for drug screening provides for high throughput screening of compounds 

having suitable binding affinity to the protein of interest. (See, e.g., Gcysen, et al. (1984) PCT 
application WO84/035 64.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with RECAP, or fragments thereof, and 
washed. Bound RECAP is then detected by methods well known in the art. Purified RECAP can also be 

35 coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 
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noH-ncuuking andbodics can be used to capture the peptide and immobilize it on a solid support. 

in another embodiment, one may use competitive drug screening assays in which neutralizing ; 
- antibodies capable of binding RECAP specifically compete with a test compound for binding RECAP. 
In this manner, anUb«lies can be used to detect the presence of any pepUdew^ 

5 antigenic determinants with RECAP. v 

; ,n addiUonal embodiments, the nucleodde sequent 

molecular biology techniques that have yet to be develop 

properties of nucleotide sequences that are currently known, including, but not limited to, such properties 

as the triplet genetic code and specific base pair interactions. ; - 
10 Without further' elaboration, it is believed that one- skilled in the art can. using me preceding; . • . 

description, utilize the present invenUon to its fuHest extent. The following preferred specific 

embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder .. 

of the disclosure in any way whatsoever. 

■ The disclosures of all patents, applications/and publications mentioned above and below, in 
15 particular U.S. Ser. No. 60/145,232, U.S. Ser. No. 60/158,578, and U.S. Ser. No. 60/165,192. are 

hereby expressly incorporated by reference. 



• ' EXAMPLES 
Ii Construction of cDNA Libraries • 
20 • RNA was purchased from Clontech or isolated from tissues described in Table 4. Some tissues 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in 
phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
cushionsor extracted with chloroform. RNA was precipitated from the lysates with either isopropanol or 
25 sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA purity. 
In some cases,' RNA was treated with DNase. For most libraries, poly(A+)_ RNA was isolated using 
oligo d(T)-couplcd paramagnetic particles (Promega), OUGOTEX latex particles (QIAGEN, 
Chatsworlh CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
30 isolated directly from tissue lysates using other RNA isolation kits, e.g., the PGLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). ' . 

- In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system' (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies),' using the 
35 recommended procedures or similar tnethods known in the art. (See, e.g , Ausubel, 1997, m*£ units 
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5.1-6.6.) Reverse transcription was initiated using oligod(T) or random primers. .Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL SI 000, SEPH AROSE CL2B, or SEPH AROSE CL4B column chromatography 
5 (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs were ligated into 
compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid 
(Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2.1 plasmid (Invitrogen, Carlsbad CA), or 
pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant plasmids were transformed into 
competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, 
1 0 DH1 0B, or ElectroM AX DH 1 0B from Life Technologies. 
II. Isolation of cDNA Clones 

. . . Plasmids obtained as described in Example I were recovered from host cells by in vivo excision 
using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using at least one 
of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC 

15 Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid 
purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of distilled 
water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PGR in a high- 

20 throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal cycling 
steps were carried out in a single reaction mixture. Samples were processed and stored in 384-well 
plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a.FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 

25 III. Sequencing and Analysis * 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation such 
as the ABI CATALYST 800 (PE Biosystems) thermal cycler or the PTC-200 thermal cycler(MJ , 
Research) in conjunction with the HYDRA microdaspeaser (Robbins Scientific) or the MICROLAB 

30 2200 (Hamilton) liquid transfer system. , cDNA sequencing reactions were prepared using reagents . - 

provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI PRISM 

BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic separation of 

, cDNA sequencing reactions and detection of labeled poly nucleotides, were carried out using the 
* * ■ 

. MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373.or 377 

35 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base calling , 
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software; or other sequence analysis systems known in the art; Reading frames within the cDNA 
• sequences were identified using standard methods (reviewed in Ausubel, 1997, suETJ. unit 7.7). Some of. 
the cDNA sequences: were selected for extension using the techniques disclosed in Example V. 

The polynucleotide sequences derived from cDNA sequencing were assembled arid analyzed 
5 using a combination of software programs which utilize algorithms well known to those skilled in the art. 
TableS summarizes the tools, programs, and algorithms used and.prpvides applicable descripdoas, . ^ 
references, and threshold parameters. The first column of Table 5 shows the tools, programs, arid 
" algorithms used, the second column provides brief descriptions thereof, the third column presents: _ 

appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
10 column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
15 algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which 
also calculates the percent identity between aligned sequences. 

The polynucleotide sequences were validated by removing vector, linker, .and pdyA sequences' 
and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 
programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a 
20 selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and ^ 
eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 
using programs based on BLAST, FAST A, and BLIMPS^ The sequences were assembled intofull 
length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
25 polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, 
and these full length sequences were subsequently analyzed by querying against databases such as the 
GenBank databases (described above). SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, and 
Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a probabilistic 
approach which analyzes consensus primary structures of gene families. (See, e.g., Eddy, S.R. (1996) 

30 Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 
amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID NO:23- 
44. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. . 

35 IV. Analysis of Polynucleotide Expression 
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Northern analysis is a laboratory technique.used to. detect the presence of a transcript of a gene 
and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, supra , 
ch. 4 and 16.). .... f ; 

5 Analogous-computer techniques applying BLAST were used, to search for identical or related . 

molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is much 
faster than multiple membrane-based hybridizations. Tn addition, the sensitivity of the computer search 
can be modified to determine whether any particular match is categorized as exact or similar., The basis 
of the search is the product score, which is defined as: . f . ... ^, 

10 , . \ BLAST Score x Percent Identity 

. 5 x minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. _The product score is a normalized value between 0 and 100, and is calculated as 
15 follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided by 
(5 times the length of the shorter of the two sequences). The BLAST score is calculated by assigning a 
score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for every mismatch. 
Two sequences may share more than one HSP (separated by gaps). If there is morethan one HSP? then 
the pair with the highest BLAST score is used to calculate the product score. The product score - 

20 represents a balance between fractional overlap and quality in a BLAST alignment. For example, a 
product score of 100 is produced only for 100% identity over the entire length of the shorter of the two 
sequences being compared. A product score of 70 is produced either by 1 00% identity and 70% overlap 
at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is produced either 
by 100% identity and 50%,overlap at one end, or 79% identity and J00% overlap. 

25 The results of northern analyses are reported as a percentage distribution of libraries in which the 

transcript encoding RECAP occurred. Analysis involved the categorization of cDNA-libraries by * 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologies 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, r 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, cell 

30 proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 3. 
V. Chromosomal Mapping of RECAP Encoding Polynucleotides : ~ : 

The cDNA sequences which were used to assemble SEQ ID NO:23-44 were compared with 

35 sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other \ 
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implementations of the Smith- Waterman algorithm. Sequences from these databases that matched SEQ 
ID NO:23-44 were assembled into clusters of contiguous and overlapping sequences using assembly 
algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available from public 

: : resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome 
5 Research (WIGR), and G6n£thon were used to determine if any of the clustered sequences had been 
— —^ previously mapped.- Inclusion of-a-mapped sequence in a cluster resulted in the assignmenVof all- - - 
sequences of that cluster, including its particular SEQ ID NO:, to that map location. 
. '' ^ r ^ ;:;'i-y'v % Genetic 'map locations are reported as ranges, or intervals, of humim c^ 
The map position of an interval, in centiMorgaris, is measured relative to the terminus of the 

10 chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 
frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase 
(Mb) of DN A in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by G6n6thon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. SEQ ID NO:24 maps 

15 to chromosome 1 within the interval from 12.8 to 22.9 centiMorgans. SEQ ID NO:36 maps to 
chromosome 1 within the interval from 74.8 to 78.3 centiMorgans. 
VI. Extension of RECAP Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:23-44 were produced by extension of an 
appropriate fragment of the full length molecule using oligonucleotide primers designed from this 

20 fragment. One primer was synthesized to initiate 5 * extension of the known fragment, and the other 
primer, to. initiate 3* extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National. Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of aboui 68 °C to about 72 °C Any stretch of nucleotides which would result in hairpin 

25 structures and primer-primer dimerizations was avoided/ : 

, Selected human cDNA libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR was 
performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction mix 

30 contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NTH 4 ) 2 S0 4f and P- 
mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme (Life . 
Technologies), and Pfu DNA polymerase (Stratagene),: with the following parameters for primer pair PCI 
A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 min; 
Step 5: Steps. 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 niin; Step 7: storage at 4°C. In the 

35 alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2: . 
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94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68°C, 5 min; : Step 7: storage at 4°C. ;. 

The concentration of DNA in each well was, determined by dispensing 1 00 pi PICOGREEN 
quantitation reagent (0.25% (y/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE and 
5 0.5 |il of undiluted PCR product into each well of an opaque fiuorimeter plate (Corning Costar, Acton 
MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II (Labsystems 
Oy ( Helsinki, Finland) to measure the fluorescence of the sample and to quantify the concentration of 
DNA. A 5 tA to 10 y\ aliquot of the reaction mixture was analyzed by electrophoresis on a 1 % agarose 
mini-gel to determine which reactions were successful in extending the sequence. 
10 The extended nucleotides were desalted and concentrated, transferred to 384-well plates, digested 

with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and sonicated or 
sheared prior to religation into pUC 1 8 vector (Amersham Pharmacia Biotech). For shotgun sequencing, 
the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, fragments were 
excised, and agar digested with Agar ACE (Promega). Extended clones were religated using T4 ligase 
15 (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham Pharmacia Biotech), treated with 
Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and transfected into competent E. 
coH cells. Transformed cells were selected on antibiotic-containing media, and individual colonies were 
picked and cultured overnight at 37 °C in 384- well plates in LB/2x carb liquid media. :: 

The cells were lysed, and DNA was amplified by PCR.using Taq DNA polymerase (Amersham 
20 Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step"! 94°C, 
3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4" 
repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulfoxide (1 :2, 
25 v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 

DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle sequencing 

ready reaction kit (PE Biosystems). • : ' . • . : 

In like manner, the polynucleotide sequences of SEQ ID NO:23-44 are used to obtain 5' 
regulatory sequences using the procedure above, along with oligonucleotides designed for such extension, 
30 and an appropriate genomic library., r. , - _ f . . . , - 

VII. Labeling and Use of Individual Hybridization Probes . ■ ■ ' . 

Hybridization probes derived from SEQ ID NO:23-44 are employed to screen cDN As, genomic 
DNAs, or mRNAs. Although the labeling of oligonucleoUdes, coasisting of about 20 base pairs, is 
specifically desCTibed, essentially the same procedure is used with larger nucleotide fragments. 
35 Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National ' 
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Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^Ci of [y- 32 P] adenosine 

triphosphate (Amer sham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston MA). 

The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
- exclusion dextran bead column (Amersham Pharmacia Biotech), An aliquot containing 10 7 counts per 
5 minute of the labeled probe is used in' a typical membrane-based hybridization analysis of human 
—h genomic DNA digested with one of the following endonucleasesi- Ase I t Bgl.II,:Eco RI; Pst I, Xba I, or _ 

Pvu II (DuPont NEN). 

v The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 1 6 hours 
10 at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature under 
conditions of up to,' for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
■ compared. : ' * \ ' ' • \ ,_ 

VIII. Microarrays 

15 The linkage or synthesis of array elements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing, See, e.g. ( Baldeschweiler, su^ra), mechanical 
microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 
technologies should be uniform and solid with a non-porous surface (Schena (1 999), supra). Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 

20 analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a substrate 
using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced using 
available methods and machines well known to those of ordinary skill in the art and may contain any 
appropriate number of elements. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et 
al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

25 * Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. After 

30 hybridization, nonhybridized nucleotides from the biological sample are removed, and a fluorescence 
scanner is used to detect hybridization at each array element. Alternatively, laser desorbtion and mass 
spectrometry may be used for detection of hybridization. The degree of complementarity and the relative 
abundance of each polynucleotide which hybridizes to an element on the microarray may be assessed. In 
one embodiment, microarray preparation and usage is described in detail below. . „ 

35 Tissue or Cell Sample Preparation 
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Total RNA is isolated from tissue samples using theguanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is . 
reverse transcribed using MMLV reverse-transcriptase,. 0.05 pg/pl oligo-(dT) primer (21mer), IX first 
strand buffer, o;03 units/pl RNase inhibitor, 500 jiM.dATP, 500 dGTP, 500 dTTP, 40 nM 
5 dCTP, 40 dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng polyfA)* RNA with . 
GEMBRIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37°C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and . 

10 incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended 

15 in 14 Ml 5X SSC/0.2% SDS. . . . . ^ . 

Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 
20 amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 |ig. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 

Purified array elements are immobilized on polymer^coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR Scientific 
25 Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and coated 
with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 |il of the array element DNA, at an average 
concentration of 100 ng/pl, is loaded into the open capillary printing element by a high-speed robotic 
30 apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crossl inker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 °C followed by washes in 0.2% 
35 SDS and distilled water as before. 
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; Hybridization ' "' ' .' " ' '■ 

Hybridization reactions contain 9 ul of sample mixture consisting of 0.2 ug each of Cy3 and 

■ Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridizaUon buffer. The sample 
mixture is heated to 65*0 for 5 minutes arid is aliquoted onto the microarray surface and covered with 

5 an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly; 

■ larger than a microscope slide. The'chamber is kept at-100% humidity internally by the addition of 140 
ul of 5X SSC in a corner of the chamber. The chamber 

hours at60°C. The arrays are washed for 10 min at 45 °C in a first wash buffer (IX SSC, 0.1 %-SDS). 
three times for 10 minutes each at 45°C in a second wash buffer (0.1X SSC), and dried. ^ 

10 Detection ;■' •• " ' ~ 

Reporter-labeled hybridizaUon complexes are detected with a microscope equipped with an . 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) Capable of generating spectral lines 

v at488 hm for excitation of Cy3 and at 632 nm for excitation of 

focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 

15 containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1 .8 cm x 1.8 cm array used in the present example is scannedwith a 

resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two;fluorophores sequentially; .v 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
20 Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophbres. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of me fluorophores used are 565 hm for Cy3 arid 650 nm for Cy5. Each array is 
typically scanned twice, one scan' per fluorophore using the appropriate filters at the laser source, 
, although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 
25 - The sensitivity of the scans is typically calibrated using the signal intensity generated by a ; 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence/allowing the intensity of the signal at that location 
to be correlated with a weight ratio of hybridizing species of 1 : 100,000. When two samples from 
different sources (e.g., representing test and control cells), each labeled with a different fluorophore. are 
30 hybridized to a single array for the purpose of identifying genes that are differentially expressed, the 
calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and adding 
identical amounts of each to the hybridization mixture. 

. The output of me photomultiplier tube is digitized 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
35 computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
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linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high . . 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) between the fluorophores using each iluorophore's emission spectrum. 
5 - A grid is superimposed over the fluorescence signal image such that the signal from each spot 

is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used for 
signal analysis is the GEMTOOLS gene expression analysis program (Incyte). ^ .... 

IX. Complementary Polynucleotides 

10 Sequences complementary to the RECAP-encoding sequences, or any parts thereof, are used to 

detect, decrease, or inhibit expression of naturally occurring RECAP. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with : 
smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 4.06 
software (National Biosciences) and the coding sequence of RECAP.. To inhibit transcription, a 

15 complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent 
promoter binding to the coding sequence. , To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the RECAP-encoding transcript. 

X. Expression of RECAP v? 
Expression and purification of RECAP is achieved using bacterial or virus-based expression 

20 systems. For expression of RECAP in bacteria, cDN A is subcloned into an appropriate vector^ 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g : , BL21(DE3). Antibiotic 

25 resistant bacteria express RECAP upon induction with isopropyl beta-D-thiogalactopyranoside (IPTG). 
Expression of RECAP in eukaryotic cells is achieved by infecting insect or mammalian cell lines with 
recombinant Autograohica californica nuclear polyhedrosis virus (AcMNPV), commonly known as 
baculovirus. The nonessential polyhedfin gene of baculovirus is replaced with cDN A encoding RECAP 
by either homologous recombination or bacterial-mediated transposition involving transfer plasmid 

30 intermediates. Viral infectivity is maintained and the strong polyhedrin promoter drives high levels of . 
cDNA transcription. Recombinant baculovirus is used to infect Spodontera frugiperda (Sf9) insect cells 
in most cases, or human hepatocytes, in some cases. Infection of the latter. requires additional genetic 
modifications to baculovirus. (See Engelhard,. E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 9 1:3224- 
3227; Sandig, V. et at. (1996) Hum. GeneTher. 7:1937-1945.) 

35 In most expression systems, RECAP is synthesized as a fusion protein with, e.g., glutathione S- 
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transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, .single-step, 
affinity based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 

5 Biotech). Following purification, the GST moiety can be proteolytically cleaved from RECAP at 
1 specifically engineered sites FLAG, an 8-amino acid peptide, enables immunoaffinity-purification using 
commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a 
stretch of six consecutive histidine residues, enables purification on metal-chelate resins (QIAGEN). 
Methods for protein expression and purificaUon are discussed in Ausubel (1995, supra, ch. 10 and 16). 

10 Purified RECAP obtained by these methods can be used directly in the assays shown in Examples XI and 

: " '■ xv. ; ; - ; ■ \; , . * ... 

XI. Demonstration of RECAP Activity 

Receptor activity of RECAP is determined in a ligand-binding assay using candidate ligand 
molecules in the presence of 125 I-labeled RECAP. RECAP is labeled with 125 I Boltpn-Hunter reagent. 

15 (See, e.g., Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539). Candidate ligand molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled RECAP, washed, and 
any wells with labeled RECAP complex are assayed. Data obtained using different concentrations of 
RECAP are used to calculate values for the number, affinity, and association of RECAP with the ligand 
molecules. The level of binding measured is proportional to the level of RECAP activity. 

20 In the alternative, activity of RECAP may be measured using an assay based upon the 

property of some GPCRs to support the in vitro proliferation of fibroblasts and tumor cells under 
serum-free conditions (Chiquet-Ehrismann, R, et al. (1986) Cell 47:131-139). Wells in 96 well cluster 
plates (Falcon, Fisher Scientific, Santa Clara C A) are coated with RECAP by incubation with solutions 
at 50-100 ng/nil for 15 min at ambient temperature. The coating solution is aspirated, and the wells 

25 washed with Dulbecco's medium before cells are plated. Rat fibroblast cultures or rat mammary tumor 
cells are prepared as described and plated at a density of 10 4 -10 5 cells/ml in Dulbecco's medium 
supplemented with 10% fetal calf serum (FCS). 

After three days the media are removed, and the cells washed three times with phosphate- 
buffered saline (PBS) before the addition of serum-free Dulbecco's medium containing 0.25 mg/ml 

30 bovine serum albumin (BS A, Fraction V, Sigma Chemical, St. Louis, MO). After 2 days the medium 
is aspirated, and 100 pi of [ 3 H]thymidine (NEN) at 2 pCi/ml in fresh Dulbecco's medium containing 
0.25 mg/ml BSA added. Parallel plates are fixed and stained to determine cell numbers. After 16 hr, 
the medium is aspirated, the cell layer washed with PBS, and the 10% trichloroacetic acid-precipi table 
counts in the cell layer determined by liquid scintillation counting of radioisotope (normalized to 

35 relaUve cell numbers; Chiquet-Ehrismann, R. et al. (1986) supra). The rates of cell proliferation and 
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[ 3 H]thymidine uptake are proportional .to the activity of RECAP in the sample. 

In the alternative, the assay for RECAP activity is based upon the property of CD97/Emrl 
GPCR family proteins to modulate G protein-activated second messenger signal transduction pathways 
(e.g., cAMP; Gaudin, P., et al. (1998) J. Biol. Chem., 273:4990-4996). A plasmid encoding full length 
5 RECAP is transfccted into a mammalian cell line (e.g., COS-7 or Chinese hamster ovary (CHOrKl) 
cell lines) using methods well-known in the art. Transfected cells are grown in 12-well trays in culture 
medium containing 2% FCS for 48 hours, the culture medium is discarded, then the attached cells are 
gently washed with PBS. The cells are then incubated in culture medium with 10% FCS or 2% FCS 
for 30 minutes, then the medium is removed and cells lysed by treatment with 1 M perchloric acid. 

10 The cAMP levels in the lysate are measured by radioimmunoassay using methods well-known in the 
art. Changes in the levels of cAMP in the lysate from 10% FCS-treated cells compared with those in 
2% FCS-treated cells are proportional to the activity of RECAP present in the transfected cells. 

In another alternative, an assay for RECAP activity is based on a prototypical assay for 
ligand/receptor-mediated modulation of cell proliferation. This assay measures the rate of DNA 

15 synthesis in Swiss mouse 3T3 cells. A plasmid containing polynucleotides encoding RECAP is added to 

quiescent 3T3 cultured cells using transfection methods well known in the art The transiently ' d 
transfected cells are then incubated in the presence of [ 3 H]thymidine, a radioactive DNA precursor 4 
molecule. Varying amounts of RECAP ligand are then added to the cultured cells. Incorporation of v 
[ 3 H]thymidine into acid-precipitable DNA is measured over an appropriate time interval using a I 

20 radioisotope counter, and the amount incorporated is directly proportional to the amount of newly 

synthesized DNA. A linear dose-response curve over at least a hundred- fold RECAP ligand 

' • • • ^ 

concentration range is indicative of receptor activity. One unit of activity per milliliter is defined as the ^ 

concentration of RECAP producing a 50% response level, where 100% represents maximal incorporation 

of [ 3 H]thymidine into acid-precipitable DNA (McKay, I. and Leigh, I., eds. (1993) Growth Factors: A 

25 Practical Approach . Oxford University Press, New York, NY, p. 73.) 

In the alternative, the assay for RECAP activity is based upon the ability of GPCR family 
proteins to modulate G protein-activated second messenger signal transduction pathways (e.g., cAMP; 
Gaudin, P. et al. (1998) J. Biol. Chem. 273:4990-4996). A plasmid encoding full length RECAP is 
transfected into a mammalian cell line (e.g., Chinese hamster ovary (CHO) or human embryonic kidney 

30 (HEK-293) cell lines) using methods well-known in the art. Transfected cells are grown in 12-well 
trays in culture medium for 48 hours, then the culture medium is discarded, and the attached cells are 
gently washed with PBS. The cells are then incubated in culture medium with or without ligand for 30 
minutes, then the medium is removed and cells lysed by treatment with 1 M perchloric acid. The 
cAMP levels in the lysate are measured by radioimmunoassay using methods well-known in the art. 

35 Changes in the levels of cAMP in the lysate from cells exposed to ligand compared to those without 
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ligand are proportional to the amount of RECAP present in the transfected cells. 
XIL Functional Assays 

RECAP function is assessed by expressing the sequences encoding RECAP at physiologically 
elevated levels in mammalian cell culture systems. cDN A is subcloned into a mammalian expression 

5 vector containing a strong promoter that drives high levels of c'DNA expression. Vectors of choice 
- - include pCMV-SPORT (Life Technoto^^ CA),_both of which J ; _ : 

contain the cytomegalovirus promoter. 5-10 /ig of recombinant vector are transiently transfectcd into a 
human cell line, for example, an endothelial or hematopoietic cell line, using either liposome formulations 
or electroporation. 1-2 jag of an additional plasmid containing sequences encoding a marker protein are 

10 co-transfected Expression of a marker protein provides a means to distinguish transfected cells from 
nontransfected cells and is a reliable predictor of cDNA expression from the recombinant vector. Marker 
proteins of choice include, e.g., Green Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP 
fusion proteia Row cytometry (FCM), an automated, laser optics-based technique, is used to identify 
transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of the cells and other 

15 cellular properties. FCM detects and quantifies the uptake of fluorescent molecules that diagnose events 
preceding or coincident with cell death. These events include changes in nuclear DNA content as 
measured by staining of DNA with propidium iodide; changes in cell size and granularity as measured by 
forward light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular proteins 

20 as measured by reactivity with specific antibodies; and alterations in plasma membrane composition as 
measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in 
flow cytometry are discussed in Ormerod, M.G. (1994) FTow^Cytometry, Oxford, New York NY. 

The influence of RECAP on gene expression can be assessed using highly purified populations of 
cells transfected with sequences encoding RECAP and either CD64 or CD64-GFP. CD64 and CD64- 

25 GFP are expressed on the surface of transfected cells and bind to conserved regions of human 

immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). 
mRNA can be purified from the cells using methods well known by those of skill in the art. Expression 
of mRNA encoding RECAP and other genes of interest can be analyzed by northern analysis or 

30 microarray techniques. ; 

XIII. Production of RECAP Specific Antibodies 

RECAP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1 990) Methods Enzymol. 1 82;488r495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

35 - Alternatively, the RECAP amino acid sequence is analyzed using LASERGENE software 
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(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods few- 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in ,the art. (See, e.g., Ausubel. 1995. supra , cIl ll.V 
5 Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A ^ 

peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma- Aldrich, St. 
Louis MO) by. reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the oligopeptide- KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-RECAP 
10 activity by, for example, binding the peptide or RECAP to a substrate, blocking with 1% BSA, reacting 
with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIV. Purification of Naturally Occurring RECAP Using Specific Antibodies 
Naturally occurring or recombinant RECAP is substantially purified by immiinoaffinity 

chromatography using antibodies specific for RECAP. An immunoaffinity column is constructed by 
15 covalently coupling. anti-RECAP antibody to an activated chromatographic resin, such as ; 

CNBr-activated SEPH AROSE (Amersham Pharmacia Biotech). After the coupling, the resin is blocked 

and washed according to the manufacturer's instructions. 

Media containing RECAP are passed over the immunoaffinity column, and the column is^washed 

under conditions that allow the preferential absorbance of RECAP (e.g., high ionic strength buffers in the 
20 presence of detergent). The column is eluted under conditions that disrupt antibody/REC AP binding 

(e;g., a buffo- of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), 

and RECAP is collected. 

XV, Identification of Molecules Which Interact with RECAP 

RECAP, or biologically active fragments thereof, are labeled with l25 l Boltpn-Hunter reagent] 
25 (See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 1 33:529-539.) Candidate molecules 

previously arrayed in the wells of a muld-well plate are incubated with the labeled RECAP, washed, and 
any wells with labeled RECAP complex are assayed. Data obtained using different concentrations of 
RECAP are used to calculate values for the number, affinity, and association of RECAP with the 
candidate molecules. •■ 
30 Alternatively, molecules interacting with RECAP are analyzed using the yeast two-hybrid . 

system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

RECAP may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
35 between the proteins encoded by two large libraries. of genes (Nandabalan, K. et al. (2000). U.S. Patent 
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Various modifications and variations of the described methods and systems of the invention will 
be apparent to those skilled in the art without deparUng from the scope and spirit of the inven 
5 Although the invention has been described in connection with certain embodiments, it should be 
- uncferstood that the inveridoli a<claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious to 
those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. -.- . ' ' , " ' " • . ./ . • ■ 
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What is claimed is: " , , 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO:l , SEQ ID NO:2, 

SEQ ID NO:3, SEQ ID NOi4, SEQ ID NO:5; SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID- 
NO:9, SEQ ID NO: 10, SEQ ID NO:ll, SEQ ID NO; 12, SEQ ID NO:14, SEQ ID NO:15, SEQ ID 
NO:16, SEQ ID NO:17, SEQ IDNO:18, SEQ ID NO:19, SEQ ID NO:20, and SEQ ID NO:22, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
10 amino acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 

NO:3. SEQ ID NO:4, SEQ ID NO :5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO:10, SEQ ID NO:ll, SEQ ID NO: 12, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16. 
SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, and SEQ ID NO:22, 

c) a biologically active fragment of an amino acid sequence selected from the group consisting 
15 of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 

NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID,NO:10, SEQJD NO: IT, 'SEQ ID NO:i2,,SEQ ID 
NO:14,.SEQIDNO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO: 1 8, SEQ ID NO:l 9, SEQ ID 
NO:20; and SEQ ID NO:22, arid 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting of 
20 SEQ ID NO 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 

NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NOilO, ^^SEQ ID NO:ll, SEQ ID NO:12, SEQ ID 
NO:14, SEQ IDNO:15, SEQ ID'NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO: 19, SEQ ID 
NO:20, and SEQ ID NO:22. 

25 2/ An isolated polypepUde of claim 1 selected from the group consisting of SEQ ID NO: 1, 

seq id no:2, seq id no:3, seq id no:4, seq id no:5, seq id no:6, seq id no:7. seq id 
no:8, seq id no:9, seq id no:10, seq id no:l 1 , seq id no:12, seq id no:14, seq id 
no:15, seq id no: 1.6, seq id no:17, seq id no: 18, seq id no:19, seq id no:20, and seq id 
no:22. ; : " ' _ ■ ; ' ".' \ /. '..:'.... -/ ' ' : : 

30 , '.' ' .'' ■'. ... ' • , '• : 

3. An isolated polynucleotide encoding a polypeptide of claim 1.. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

35 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID NO:23, 
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SEQ ID NO:24 t SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO;30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, 
SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, and 
SEQIDNO:44. 

5 ; . ' . . , ... - 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 



10 



7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 



9. A method for producing a polypeptide of claim l f the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 

15 cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide comprises 

a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 1 f and ||| 

b) recovering the polypeptide so expressed. -•■ "- *3Sf 

■ . • . . ■ - ■ ' . - ^ '" : - < • 

... . ■ ".4- fx 

10. An isolated antibody which specifically binds to a polypeptide of claim 1. > _ : M 

1 1. An isolated polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of: * 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 

25 NO:30, SEQ ID NO:31 f SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID 

NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, and SEQ ID * 
NO:44, ' .' . ' ......... 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:23, SEQ ID NO:24, SEQ 

30 ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29. SEQ ID NO:30, SEQ ID 
NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID 
NO:38, SEQ ID NO:39, SEQ ID NO:40. SEQ ID NO:41 , SEQ ID NO:42, and SEQ ID NO:44, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 
35 e) an RNA equivalent of a)-d). 

88 
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12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a , 
polynucleotide of claim 11. 

13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
5 having a sequence of a polynucleotide of claim 1 1 /the method comprising: . 

— a) hybridizing the sample with a probe comprising at least 26cpntiguousnucleotides v 

comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex 
is formed between said probe and said target polynucleotide or fragments thereof, and , 
10 b) detecting the presence or absence of said hybridization. complex, and, optionally, if present, 

the amount thereof •". 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

15 is. A method for detecting a target polynucleotide in a sample, said target polynucleotide 

having a sequence of a polynucleotide of claim 11, the method comprising: 

' a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
20 thereof, and, optionally, if present, the amount thereof. 

16. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 1 
and a pharmaceutical^ acceptable excipient. 

25 17. A pharmaceutical composition of claim 16, wherein the polypeptide comprises an amino 

1 acid sequence selected from the group consisting of SEQ ID NO:l, SEQ IP NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, 
SEQ IDNO:ll,SEQID NO:12, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO: 16, SEQ ID NO:17, 
SEQIDNO:18,SEQIDNO:19,SEQIDNO:20,andSEQIDNO:22. V 

30 ■ ' ■. • " .7 - - ' •. v . ';. : 7 • ■:. , ■. .'■ 

18. A method for treating a disease or condition associated with decreased expression of 
functional RECAP, comprising administering to a patient in need of such treatment the pharmaceutical 
composition of claim 16. _ .. ', 

35 19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
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claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and _ 

b) detecting agonist activity in the sample. 

5 20. A pharmaceutical composition comprising an agonist compound identified by a method of 

claim 19 and a pharmaceutical^ acceptable excipient 

21. A method for treating a disease or condition associated with decreased expression of 
functional RECAP, comprising administering to a patient in need of such treatment a pharmaceutical 

10 composition of claim 20. 

22. A method for screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
15 b) detecting antagonist activity in the sample. 

~ ■ . ■ ' ' * 

23. A pharmaceutical composition comprising an antagonist compound identified by a method r 
of claim 22 and a pharmaceutical^ acceptable excipient - : u 

20 24, A method for treating a disease or condition associated with overexpression of functional f 

RECAP, comprising administering to a patient in need of such treatment a pharmaceutical composition 
of claim 23. . 5& 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
25 1, said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
compound that specifically binds to the polypeptide of claim 1 . 

30 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1 , 
35 b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

90 
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• ■ ■ . ■'?:_ c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound : 
with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change in 
theactivity of the polypeptide of claim l in the presence ofthe test compouhd is indicative of a 
5 compound that modulates the activity of the polypeptide of claim 1 . 

27. A method for screening a compound for effectiveness in altering expression of a target 
• . polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

comprising: ■ ■ ■. 
10 a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 

•' • . . 28. A method for assessing toxicity of a test compound, said method comprising: 
a) treating a biological sample containing nucleic acids with the test compound; 

15 ^hybridizing 

least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific . ; 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
11 or fragment thereof; x 

20 c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the amount 
of hybridization complex in the treated biological sample is indicative of toxicity of the test compound. 
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SEQUENCE LISTING 



<110> INCYTE genomics; INC. 
AU-YOUNG, Janice 
. BANDMAN, Olga 
TANG, Y. Tom 
YUE, Henry 
AZIMZAI, Yalda 
BURFORD, Neil 
BAUGHN, Mariah R. 
LU, Dyung Aina M. 
HILLMAN, Jennifer L. 
PATTERSON, Chandra 
LAL, Preeti 



<120> RECEPTORS AND ASSOCIATED PROTEINS 
<130> PF-0726 PCT 



<140> To Be Assigned 

<141> Herewith • 

<150> 60/145,232; 60/158,578; 60/165,192 ^ 
<151> 1999-07-21; 1999-10-07; 1999-11-12 

<160> 44 

<170> PERL Program . 

<210> 1 
<211> 411 

<212> PRT ' 
<213> Homo sapiens 

<220> 

<221> raisc_ feature 

<223> Incyte ID No: 209171CD1 

<400> 1 • 

Met Ala Gin Arg Gin Leu Leu.Asn Lys Lys Gly Phe Gly Glu 

1 5 10 

Val Leu Pro Arg Pro Pro Ser Leu lie Gin Asn Glu Cys Gly 

20 25 
Gly Glu Gin Ala Ser Glu Lys Asn Glu Cys lie Ser Glu Asp 

35 40 
Gly Asp Glu Asp Lys Glu Glu Arg Gin Glu Ser Arg Ala Ser 

50 55 
Trp His Ser Lys Thr Lys Asp Phe Gin Glu Ser Ser lie Lys 

65 70 
Leu Lys Glu Lys Leu Leu Leu Glu Glu Glu Pro Thr Thr Ser 

80 85 
Gly Gin Ser Ser Gin Gly lie Val Glu Glu Thr Ser Glu Glu 

95 100 
Asn Ser Val Pro Ala Ser Gin Ser Val Ala Ala Leu Thr Ser 

110 115 
Arg Ser Leu Val Leu Met Pro Glu Ser Ser Ala Glu Glu lie 

125 130 
Val Cys Pro Glu Thr Gin Leu Ser Ser Ser Glu Thr Phe Asp 

140 145 
Glu Arg Glu Val Ser Pro Gly Ser Arg Asp He Leu Asp Gly 

155 160 
Arg He He Met Ala Asp Lys Glu Val Gly Asn Lys Glu Asp 
170 175. 



Pro 

15 
Gin 

30 
Met 

45 
Asp 

60 
Ser 

75 
His 

90 
Gly 
105 
Lys 
120 
Thr 
135 
Leu 
150 
Val 
165 
Ala 
180 
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Glu Lys Glu Val Ala lie Ser Thr Phe Ser Ser Ser Ash Gin vai ; ; : 

185 , •' 190 : 1" - • ; 

Ser Cys Pro Leu Cys Asp Gin Cys Phe Pro Pro Thr Lys He Glu : ? v 

200 205 210 

Arg His Ala Met Tyr Cys Asn Gly Leu Met Glu Glu Asp Thr Val r 5 : 

' 215 • 220 - \ ; . 225 ■■; •• • 

Leu Thr Arg Arg Gin Lys Glu Ala Lys Thr Lys Ser Asp Ser Gly 
230 235 240 

Thr Ala Ala Gin Thr Ser Leu Asp He Asp Lys Asn Glu Lys Cys 

-245 ■-- - ,250 ~ '...255™ - ,. ... L:-.^.^. 

Tyr Leu Cys Lys Ser Leu Val Pro Phe Arg Glu Tyr Gin Cys His : 
260 265 ,270 

J val Asp Ser Cys Leu Gin Leu Ala Lys Ala Asp Gin. Gly Asp Gly _\. ' V' « . [ ■, 
• 275 ' ~ 280 ..' * • 285 < . 

Pro Glu Gly Ser Gly Arg Ala Cys Ser Thr Val Glu Gly Lys Trp 

290 "295 300 . 

Gin Gin Arg Leu Lys Asn Pro Lys Glu Lys Gly His Ser Glu Gly 
305 310 315 

:. - Arc Leu Leu Ser Phe Leu Glu Gin Ser Glu His Lys Thr Ser Asp; 

■ 320 325 330 - 

Ala Asp He Lys Ser Ser Glu Thr Gly Ala Phe Arg Val Pro Ser 
335 340 . 345 

Pro Gly Met Glu Glu Ala Gly Cys Ser Arg Glu Met Gin Ser Ser > 

. . 350 355 360 

Phe Thr Arg Arg Asp Leu Asn Glu Ser Pro Val Lys Ser Phe Val ■ ^ 

. 365 370 375 

Ser He Ser Glu Ala Thr Asp Cys Leu Val Asp. Phe Lys Lys Gin 

380 \ 385. 390 , 

" Val Thr Val Gin' Pro Gly Ser Arg Thr Arg Thr Lys Ala Gly Arg, /; 

395 400 405 

Gly Arg Arg Arg Lys, Phe 
" '. ■ ' 410 . ■' ,.\ / . . ; ■ , . , ... .'. . . 

<2io> 2 " '■" ' ' : " • ••".*' "-. . ' . . 

<211> 579 - : • . r ..* . • 

<2i2> prt : ; ' ; / : . '•' ' •'. ' 

<213> Homo sapiens . • - 

' <220> " ' ' ' . . • • -. • : ' : ' 

<22 1> misc_f eature 



15 
Val 

30 ' 
Phe 
45 
Thr 
60 
His 
75 
Gly 
90 
Gly 
105 
Thr 
120 
Ser 
135 
Leu 
150 
Phe 
165 



<223> Incyte ID 


No: 


945430CD1 






<400> 2 














Phe 


He Arg 


Met 


Phe 


Phe 


Arg 


Val 


Phe 


Leu 


His 


1 






5 










10 


Thr 


Ala 


Val 


Asp 


Phe 


Leu 


Pro 


Val 


Met 


Val His 






20 










■25 


Phe 


Lys 


Arg 


Tyr 


Met 


Gly 


Asn 


Thr 


Pro 


Gin Lys 








35 










40 


Gly 


Gin 


Cys 


Arg 


Gly 


Leu 


Pro 


Cys 


Val 


Ala Pro 








50 










55 . 


Val 


Glu 


Glu 


Ala 


Pro 


Arg 


<31y. 


He 


Ser 


Ala Arg 










65 










70 


Phe 


Pro 


Lys 


Trp 


Leu 


Asn 


Gly 


Ser 


Leu 


Leu Arg 








80 










85 


Lys 


Phe 


Glu 


Phe 


Gly 


Lys 


Asp 


Lys 


Tyr Asn His 








95 










100 


Met 


Ala 


Leu 


Leu 


His 


Gin 


Phe 


Arg 


Met 


Ala Lys 










110 










115 


Tyr 


Arg 


Ser 


Lys 


Phe 


Leu 


Gin 


Ser 


Asp 


Thr Tyr 






125 










130 


Ala 


Lys 


Asn 


Arg 


lie 


Val 


He 


Ser 


Glu 


Phe Gly 








140 










145 


Pro 


Asp 


Pro 


Cys 


Lys 


Asn 


Val 


Phe 


Glu 


Arg Phe 






155 










160 
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Glu 


Leu 


Pro 


Gly 


Lys 


Ala 


Ala 


Ala 


Met 


Thr 


Asp 








170 










1?5 




Asn 


Tyr 


Val 


Arg 


Tyr 


Lys 


Gly Asp Tyr Tyr 


Leu 










185 










190 




Asn 


Phe 


Met 


Asn 


Lys 


Val 


Asp 


He 


Glu 


Thr 


Leu 










200 










205 




Lys 


Val 


Asp 


Trp 


Ser 


Lys 


Phe 


He 


Ala 


Val 


Asn 








215 










220 




His 


Pro 


His 


Tyr 


Asp 


Pro 


Asp Gly Thr Ala 


Tyr 










230 










235 




Ser 


Phe 


Gly 


Pro 


Tyr 


Gly 


Phe 


Ser Tyr 


Lys 


Val 








245 










250 




Pro 


Glu 


Lys 


Val 


Asp 


Leu 


Gly Glu Thr lie 


His 








260 










265 




He 


Cys 


Ser 


He 


Ala 


Ser 


Thr 


Glu 


Lys 


Gly 


Lys 








275 










280 




His 


Ser 


Phe 


Gly Met 


Thr 


Arg 


Asn 


Tyr 


He 


He 










290 










295 




Pro 


Leu 


Lys 


Met 


Asn 


Leu 


Trp 


Lys 


He 


Ala 


Thr 










305 










310 




Gly 


Lys 


Ala 


Phe 


Ser 


Asp 


Gly 


lie 


Ser 


Trp 


Glu 








320 










325 




Thr 


Arg 


Phe 


His 


Val 


Val 


Glu 


Lys 


Arg 


Thr 


Gly 










335 










340 




Gly 


Arg 


Tyr 


Tyr 


Ser 


Lys 


Pro. 


Phe 


Val 


Thr 


Phe 










-350 










355 




Ala 


Phe 


Glu 


ASp 


Gin 


Gly 


Cys 


Val 


He 


lie 


Asp 










365 










370 




Asp 


Asn 


Gly 


Arg 


Thr 


Leu 


Glu 


Val 


Tyr 


Gin 


Leu 








380 










385 




Lys 


Ala 


Gly 


Glu 


Gly 


Leu 


Asp 


Gin 


Val 


His 


Asn 










395 










400 




Ser 


Phe 


Pro 


Arg 


Arg 


Phe 


Val 


Leu 


Pro 


Leu 


Asn 










410 










415 




Ala 


Pro 


Glu 


Gly 


Asp 


Asn 


Leu 


Ser 


Pro 


Leu 


Ser 










425 










430 




Ser 


Ala 


Val 


Lys 


Gin 


Ala 


Asp Gly Thr 


He 


Trp 










440 










445 




Asn 


Leu 


His 


Gin 


Glu 


Asp 


Leu 


Glu 


Lys 


Glu 


Gly 










455 










460 




Pro 


Gin 


He 


Tyr 


Tyr 


Asp 


Arg 


Phe 


Ser Gly 


Lys 










470 










475 




Phe 


Tyr 


Gly 


Cys 


Gly 


Phe 


Arg 


His 


Leu 


Val 


Gly 










485 










490 




Lys 


Val 


Asp 


Val 


Val 


Asn 


Lys 


Thr 


Leu 


Lys 


Val 










500 










505 




Gly 


Phe 


Tyr 


Pro 


Ser 


Glu 


Pro 


Val 


Phe 


Val 


Pro 






515 










520 




Asn 


Glu 


Glu 


Asp 


Gly 


Gly 


Val 


He 


Leu 


Ser 


val 










530 










535 




Asn 


Gin 


Asn 


Glu 


Ser 


Asn 


Phe 


Leu 


Leu 


Val 


Leu 










545 










550 




Phe 


Glu 


Glu 


Leu 


Gly 


Arg 


Ala 


Glu 


Val 


Pro 


Val 










560 










565 




Gly 


Phe 


His 


Gly Thr 


Phe 


He 


Pro 


He 







575 
<210> 3 
<211> 370 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1305513CD1 
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Asn Thr 


Asn 


Val 
180 










Cys Thr 


Glu 


Thr 
195 










Glu Lys 


Thr 


Glu 
210 










Gly Ala 


Thr 


Ala 
225 










Asn Met 


Gly Asn 














240 










He Arg 


Val 


Pro 
255 










Gly Val 


Gin 


Val 
270 




■ * ? 






Pro Ser 


Tyr 


Tyr 
285 










Phe lie 


Glu 


Gin 
300 










Ser Lys 


He 


Arg 
315 










Pro Gin 


cys 


Asn ... 
330 


- 








Gin Leu 


Leu 


Pro . 
345 










His Gin 


lie 


Asn 
360 










Leu Cys 


Ser 


Gin 
375 










Gin Asn 


Leu 


Arg 
390 




. ■ 


- _ . 




Ser Ala 


Ala 


Lys 














405 








Val Ser 


Leu 


Asn 
420 










Tyr Thr 


Ser 


Ala 
435 




f . . ■ 






Cys Ser 


His 


Glu 
450 










Gly lie 


Glu 


Phe 














465 










Lys Tyr 


His 


Phe 
480 








Asp Ser 


Leu 


He 
495 










Trp Arg 


Glu 


Asp 
510 










Ala Pro 


Gly Thr . 














525 










Val He 


Thr 


Pro 
540 










Asp Ala 


Lys 


Asn 
555 










Gin Met 


Pro 


Tyr ^ 














570 
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Mefila Asn Tyr Ser His Ala Ala Asp Asn lie Leu Gin Asn Leu 
v 5 10 * J " J 

Ser Pro Leu Thr Ala Phe Leu Lys Leu Thr Ser Leu Gly Phe lie 
20 25 JU 

lie Gly Val Ser Val Val Gly Asn Leu Leu He Ser lie Leu Leu , 

35 40 4 3 

val Lys Asp Lys Thr Leu His Arg Ala Pro Tyr Tyr Phe Leu Leu 

^ _ 50- ■ — — — - - ~— - * ~ — ■ — "' ' : — ■' 

Asp Leu cys cys Ser Asp He Leu Arg Ser Ala lie Cys Phe Pro 
65 70 '=> 

Phe Val Phe Asn Ser Val Lys Asn Gly Ser Thr Trp Thr Tyr Gly 

80 85 3 

Thr Leu Thr Cys Lys Val lie Ala Phe Leu Gly val Leu Ser Cys 
95 100 AU3 

Phe His Thr Ala Phe Met Leu Phe Cys lie Ser Val Thr Arg Tyr 

110 115 7 

Leu Ala He Ala His His Arg Phe Tyr Thr Lys Arg Leu Thr Phe 

125 130 
Trp Thr Cys Leu Ala Val lie Cys Met Val Trp Thr Leu Ser Val 
140 145 J r ■ 

Ala Met Ala Phe Pro Pro Val Leu Asp Val Gly Thr Tyr Ser Phe 
155 160 • LOD 

He Arg Glu Lys Asp Gin Cys Thr Phe Gin His Arg Ser Phe Arg 

170 175 
Ala Asn Asp Ser Leu Gly Phe Met Leu Leu Leu Ala Leu He Leu 

185 190 , . 

Leu Ala Thr. Gin Leu Val- Tyr Leu Lys Leu lie Phe Phe Val His 
200 205 Z1U 

Asp Arg Arg Lys Met Lys Pro Val Gin Phe Val Ala Ala Val Ser 

215 220 
Gin Asn Trp Thr Phe His Gly Pro Gly Ala Ser Gly Gin Ala Ala 

230 235 
Ala Asn Trp Leu Ala Gly Phe Gly Arg Gly Pro Thr Pro Pro Thr 
245 250 > 33 

Leu Leu Gly He Arg Gin Asn Ala Asn Thr Thr Gly Arg Arg Arg 
260 . 265 . * /u 

Leu Leu Val. Leu Asp Glu Phe. Lys . Met Glu Lys Arg He Ser Arg 

275 280 

Met Phe Tyr He Met Thr Phe Leu Phe Leu Thr Leu Trp Gly Pro 

• 290 295 

Tyr Leu Val Ala Cys Tyr Trp Arg Val Phe Ala Arg Gly Pro Val 



305 "~ 
Val Pro Gly Gly Phe Leu Thr Ala Ala Val Trp' Met Ser Phe Ala 



320 - — ^ 

Gin Ala Gly He Asn Pro Phe Val Cys He Phe Ser Asn Arg Glu 



325 330 



335 340 345 

Leu Arg Arg Cys Phe Ser Thr Thr Leu Leu Tyr Cys Arg Lys Ser 

Arq Leu Pro Arg Glu Pro Tyr Cys Val lie 
365 , 370 

<210> 4 

<211> 267 , 
<212> PRT •' 
<213> Homo sapiens : , ' , 

• ' . ; ' ••• . ' ■ . - • . r: 

<220> • . ' - * * : 

<221> misc_featur"e 

<223> Incyte ID No: 1876283CD1 - / 

Me?°ila Pro Trp Ala Leu Leu Ser Pro Gly Val Leu-Val Arg Thr 

n 5 10 • - lb 

Gly His Thr Val Leu Thr Trp Gly He Thr Leu Val Leu Phe Leu 

" ^ ~ : 4/36 " " : ~ 



WO 01/07612 



PCT/US00/20035 



20 25 . 30 

His Asp Thr Glu Leu Arg Gin Trp Glu Glu Gin Gly Glu Leu Leu 
35 40 45 

Leu Pro Leu Thr Phe Leu Leu Leu Val Leu Gly Ser Leu Leu Leu 
50 55 60 

Tyr Leu Ala Val Ser Leu Met Asp Pro Gly Tyr Val Asn Val Gin 
65 70 75 

Pro Gin Pro Gin Glu Glu Leu Lys Glu Glu Gin Thr Ala Met Val 
80 85 90 

Pro Pro Ala lie Pro Leu Arg Arg Cys Arg Tyr Cys Leu Val Leu 
95 100 105 

Gin Pro Leu Arg Ala Arg His Cys Arg Glu Cys Arg Arg Cys Val 

110 115 120 . *' 

Arg Arg Tyr Asp His His Cys Pro Trp Met Glu Asn Cys Val Gly 
* 125 130 135 

Glu Arg Asn His Pro Leu Phe Val Val Tyr Leu Ala Leu Gin Leu 
140 145 150 

Val Val Leu Leu Trp Gly Leu Tyr Leu Aia Trp Ser Gly Leu Arg 
155 160 165 

Phe Phe Gin Pro Trp Gly Leu Trp Leu Arg Ser Ser Gly Leu Leu ' 

170 175 180 

Phe Ala Thr Phe Leu Leu Leu Ser Leu Phe Ser Leu Val Ala Ser 

185 190 195 1 ' • \ 

Leu Leu Leu Val Ser His Leu Tyr Leu Val Ala Ser Asn Thr Thr 
200 205 210 

Thr Trp Glu Phe lie Ser Ser His Arg lie Ala Tyr Leu Arg Gin • 

215 220 225 ' " 

Arg Pro Ser Asn Pro Phe Asp Arg Gly Leu Thr Arg Asn Leu Ala. r <. 



230 235 240 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2470285CD1 



<400> 5 * 

Met Pro Gly Pro Leu Gly Leu Leu Cys Phe Leu Ala Leu Gly Leu 
1 " 5 10 15 

Leu Gly Ser Ala Gly Pro Ser Gly Ala Ala Pro Pro Leu Cys Aia 

20 25 30 

Ala Pro Cys Ser Cys Asp Gly Asp Arg Arg Val Asp Cys Ser Gly 

35 40 45 

Lys Gly Leu Thr Ala Val Pro Glu Gly Leu Ser Ala Phe Thr Gin 

50 55, 60 

Ala Leu Asp lie Ser Met Asn Asn lie Thr Gin Leu Pro Glu Asp 

65 ( 70 75 

Ala Phe Lys Asn Phe Pro Phe Leu Glu Glu Leu Gin Leu Ala Gly 

80 85 90 

Asn Asp Leu Ser Phe lie His Pro Lys Ala Leu Ser Gly Leu Lys 

95 100 105 

Glu Leu Lys Val Leu Thr Leu Gin Asn Asn Gin Leu Lys Thr Val 
110 115 120 

Pro Ser Glu Ala He Arg Gly Leu Ser Ala Leu Gin Ser Leu Arg 
125 130 135 

Leu Asp Ala Asn His He Thr Ser Val Pro Glu Asp Ser Phe Glu 
140 145 150 

Gly Leu Val Gin Leu Arg His Leu Trp Leu Asp Asp Asn Ser Leu 



5/36 



His Phe Phe Cys Gly Trp Pro Ser Gly Ser Trp Glu Thr Leu Trp ^ : F ■ k 

245 250 " 255 

Ala Glu Glu Glu Glu Glu Gly Ser Ser Pro Ala Val 7 - 

260 265 

<210> 5 

<211> 951 ' 
<212> PRT 

<213> Homo sapiens 
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■ 155 • ^ ' • 160 ' * I 65 

Thr Glu Val Pro Val His Pro Leu; Ser Asn Leu Pro Thr Leu Gin 
. - 170 . 175 : ; ' i °° 

Ala Leu Thr Leu Ala Leu Asn Lys lie Ser Ser He Pro Asp Phe 

185 190 . . ■ 195 - 

Ala Phe Thr Asn Leu Ser Ser Leu Val Val Leu. His Leu His Asn 
200 205 210 

Asn Lys lie Arg Ser Leu Ser Gin His Cys Phe Asp Gly Leu Asp 
215 220 " D 

Asn Leu Glu Thr Xeu^Asp Leu ^n;Tyr Asn Asn Leu Gly Glu Phe 
230 235 240 

Pro Gin Ala He Lys Ala Leu Pro Ser Leu Lys Glu Leu Gly Phe 

- 245 - - 250 .•' - 255 

His Ser Asn Ser He Ser Val He Pro Asp Gly Ala Phe Asp Gly. 

260 265 ■ 270 

Asn Pro Leu Leu Arg Thr lie His Leu Tyr Asp Asn Pro Leu Ser 
275 280 285 

Phe Val Gly Asn Ser Ala Phe His Asn Leu Ser Asp Leu His Ser 
290 295 300 

Leu Val He Arg Gly Ala Ser Met Val Gin Gin Phe Pro Asn Leu 
305 310 315 

Thr Gly Thr Val His Leu Glu Ser Leu Thr Leu Thr Gly Thr Lys 
320 325 330 

He ser Ser lie Pro Asn Asn Leu Cys Gin Glu Gin Lys Met Leu 
335 340 . 345 

Arg Thr Leu Asp Leu Ser Tyr Asn Asn lie Arg Asp Leu Pro Ser 
350 355 360 

Phe Asn, Gly Cys His Ala Leu Glu Glu lie: Ser; Leu Gin Arg Asn 
365 370 , 375 

Gin lie Tyr . Gin lie Lys Glu Gly Thr Phe Gin Gly Leu He Ser 
380 385 J f u 

Leu Arg He Leu Asp Leu Ser Arg Asn Leu He His Glu He His.. 

395 400 . 405 

Ser Arg. Ala Phe Ala Thr Leu Gly Pro lie Thr Asn Leu Asp Val 
• 410 415 4 f u 

Ser Phe Asn Glu Leu Thr Ser Phe Pro Thr Glu Gly Leu Asn Gly 
425 430 < 435 

Leu Asn Gin Leu Lys Leu- Val Gly Asn Phe Lys Leu Lys Glu Ala 
. 440 445 45U 

Leu Ala Ala Lys Asp Phe Val Asn Leu Arg Ser Leu Ser val Pro 
, 455 460 :-. 465 

Tyr Ala Tyr Gin Cys Cys Ala Phe Trp Gly Cys Asp Ser Tyr Ala 

Asn Leu Asn Thr Glu Asp Asn Ser Leu Gin Asp His Ser Val Ala 

485 490 495 

Gin Glu Lys Gly Thr Ala Asp Ala Ala Asn Val Thr Ser Thr Leu 

500 505 510 

Glu Asn Glu Glu His Ser Gin He He lie His Cys Thr Pro Ser 

515 520 525 

Thr Gly Ala Phe Lys Pro Cys Glu Tyr Leu Leu Gly Ser Trp Met 

530 535 , . 540 

lie Arg Leu Thr Val Trp Phe lie Phe Leu Val Ala Leu Phe Phe 
, . - 545 . 550 . 555 

Asn Leu Leu Val lie Leu. Thr" Thr Phe Ala Ser Cys Thr Ser Leu 

560 565 - 5 

Pro Ser Ser Lys Leu Phe lie Gly Leu lie Ser Val Ser Asn Leu 

575 580 585 

Phe Met Gly lie Tyr Thr Gly He Leu Thr phe Leu Asp Ala Val 

590 595 , POO 

Ser Trp Gly Arg Phe Ala' Glu Phe Gly He Trp Trp Glu Thr Gly 

605 610 ... ' 615 

Ser Gly Cys Lys Val Ala Gly Phe Leu Ala Val Phe Ser Ser G u 

620 625 ^ 630 

Ser Ala lie Phe Leu Leu Met Leu Ala Thr Vai Glu Arg Ser Leu 
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635 . 640 '* . 645 ■ '• 

Ser Ala Lys Asp He Met Lys Asn Gly Lys Ser Asn His Leu Lys - 

650 655 660 

Gin Phe Arg Val Ala Ala Leu Leu Ala Phe Leu Gly Ala Thr Val 

665 670 675 

Ala Gly Cys Phe Pro Leu Phe His Arg Gly Glu Tyr Ser Ala Ser 

680 685 690 

Pro Leu Cys Leu Pro Phe Pro Thr Gly Glu Thr Pro Ser Leu Gly 

695 700 • 70S 

Phe Thr Val Thr Leu Val Leu Leu Asn Ser Leu Ala Phe Leu Leu 

710 715 720 

Met Ala Val He Tyr Thr Lys Leu Tyr Cys Asn Leu Glu Lys Glu 

725 730 735 

Asp Leu Ser Glu Asn Ser Gin Ser Ser Met lie Lys His Val Ala n 

740 745 750 

Trp Leu He Phe Thr Asn Cys He Phe Phe Cys Pro Val Ala Phe , 

755 760 765 

Phe Ser Phe Ala Pro Leu He Thr Ala lie Ser He Ser Pro Glu 

770 775 780 

He Met Lys Ser Val Thr Leu He Phe Phe Pro Leu Pro Ala Cys 

785 790 795 ' 

Leu Asn Pro Val Leu Tyr Val Phe Phe Asn Pro Lys Phe Lys Glu 

800 805 810 * 

Asp Trp Lys Leu Leu Lys Arg Arg Val Thr Lys Lys Ser Gly Ser 

815 820 825 

Val Ser Val Ser He Ser Ser Gin Gly Gly Cys Leu Glu Gin Asp ^ 

830 835 840 , 

Phe Tyr Tyr Asp Cys Gly Met Tyr Ser His Leu Gin Gly Asn Leu " . ^ ^ ? ! 

845 850 855 *T ' 

Thr Val Cys Asp Cys Cys Glu Ser Phe Leu Leu Thr Lys Pro Val M 

860 865 870 * )*/ " 

Ser Cys Lys His Leu He Lys Ser His Ser Cys Pro Ala Leu Ala * 4# ^ 

875 880 885 " 3 

Val Ala Ser Cys Gin Arg Pro Glu Gly Tyr Trp Ser Asp Cys Gly 

890 895 " 900 ; ^ ^ 

Thr Gin Ser Ala His Ser Asp Tyr Ala Asp Glu Glu Asp Ser Phe - \, :p J 

905 910 915 / 

Val Ser Asp Ser Ser Asp Gin Val Gin Ala Cys Gly Arg Ala Cys 4v| 

920 925 930 /'V " 

Phe Tyr Gin Ser Arg Gly Phe Pro Leu Val Arg Tyr Ala Tyr Asn , 

935 940 ~ 945 

Leu Pro Arg Val Lys Asp \ - '. - 

950 ' ; r ■ ' ' : : • 

<210> 6 . * . 

<211> 413 : . 

<212> PRT 

<213> Homo sapiens 

<220> * ■* " 1 . : 



<221> misc_ 


feature 






















<223> Incyte ID 


No: 


2925789CD1 
















<400> 6 


























Met Gly Thr 


Phe 


Cys 


Ser 


Val 


He 


Lys 


Phe 


Glu 


Asn 


Leu 


Gin 


Glu 


1 




5 










10 










15 


Leu Lys Arg 


Leu 


Cys 


His 


Trp 


Gly 


Pro 


He 


He 


Ala 


Leu 


Gly 


Val 






20 










25 










30 


lie Ala He 


Cys 


Ser 


Thr 


Met 


Ala 


Met 


He 


Asp 


Ser 


Val 


Leu 


Trp 






35 










40 










,45 


Tyr Trp Pro 


Leu 


His 


Thr 


Thr 


Gly Gly 


"Ser 


Val 


Asn 


Phe 


He 


Met 






:so 










.55 










.60 


Leu He Asn 


Trp 


Thr 


Val 


Met. 


lie 


Leu 


Tyr 


Asn 


Tyr 


Phe 


Asn 


Ala 






165 










70 










75 


Met Phe Val 


Gly 


Pro 


Gly 


Phe. 


Val 


Pro, 


Leu 


Gly 


Trp. 


Lys 


Pro 


Glu 
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«;W : bin Asp Thr Met Tyr Uu Gin Tyj Cys Lys Val Cys Gin 
Ala Tyr Lys Ala Pro Arg Ser His His Cys Arg Lys Cys Asn Arg 



s x « V ui c e cvs Pro Trp lie Asn Asn Cys 
cys Val Met Lys Met Asp His His cys f 135 



125 . 

Cys Gly Tyr Gin Asn His Ala Ser Phe Thr Leu Phe Leu Leu Leu 



XiaW^GlrSsne-His^ 



145 



Met Tyr Thr Gin Leu Tyr His Arg Leu Ser Phe_ Gly Trp Asn Thr 



160 



170 — * 1 

Va"l Lys lie " As p Met Ser Ala Ala Arg Arg Asp Pro Leu Pro lie 



175 



185 

Val Pro Phe: Gly Leu Ala Ala Phe Ala Thr , Thr Leu Phe Ala Leu 

200 ***** 

Gly.. Leu Ala Leu Gly Thr Thr lie Ala Val Gly Met Leu Phe Phe 

He Gin Met Lys Ue lie Leu Arg Asn Lys Thr Ser lie Glu ser 

230 2 j d r 

Trp He Glu Glu Lys Ala Lys Asp Arg lie Gin Tyr Tyr Gin 

Asp Glu Val Phe val Phe Pro Tyr Asp Met Gly Ser Arg Trp Arg 

- - ■ ■ 260 2bb 

Asn Phe Lys Gin val Phe Thr Trp Ser Gly val Pro Glu Gly Asp 
Gly Leu Glu Trp Pro Val Arg Glu Gly. Cys His Gin Tyr Ser Leu ; 
Thr;lle Glu Gin Leu Lys Gin Lys Ala Asp Lys Arg Val Arg Ser ... 
Val Arg Tyr Lys Val He Glu Asp Tyr Ser -Gly , Ala Cys. Cys. Pro , 
Leu Asn Lys Gly ne Lys Thr Phe Phe Thr Ser Pro Cys Thr Glu 
Glu Pro Arg; lie Gin Leu Gin -Lys Gly.GluPhe He. Leu Ala Thr 
Arg Gly-Leu. Arg Tyr- Trp Leu Tyr Gly 'Asp; Lys lie Leu Asp "Asp 

Ser Phe lie Glu 'III Val Ser Arg lie Arg Gly Trp Phe Pro Arg 

380 

Lys Cys Val Glu Lys Cys Pro Cys Asp Ala Glu Thr Asp Gin Ala 

Pro Glu Gly Glu Lys Lys Asn Arg T 

: - ' 410 

<210> 7 ■ • • ! • ' \ ; .. -'. 

<211> 144 * 
<212> PRT • ■ - : 

<213> Homo sapiens 

<220> . ,.. • 

<221> mis.c_feature 

<223> Incyte ID No: 3099990CD1 : 



190 



Me^Lys Gly Lys Ala Arg Lys Leu Phe Tyr Lys Ala lie Val Arg 

Gly Glu Glu Thr Leu Arg Val Gly Asp Cys Ala Val Phe Leu Ser- 

20 2d 
Ala Gly Arg Pro Asn Leu Pro Tyr lie Gly Arg He Glu Ser Met 

35 4U 

Trp Glu Ser. Trp Gly Ser Asn Met Val Val Lys Val Lys Trp Phe 

sr 50 

Tyr His Pro Glu Glu Thr Lys Leu Gly Lys Arg Gin Cys Asp Gly 



-8/36- 



BNSOOCID- <WO 0107612A2J_> 



WO 01/07612 



PCT/US00/20035, 



• • 65 - 70 75 

Lys Asn Ala Leu Tyr Gin Ser Cys His Glu Asp Glu Asn Asp Val 

80 85 90 

Gin Thr lie Ser His Lys Cys Gin Val Val Ala Arg Glu Gin Tyr 

95 100 105 

Glu Gin Met Ala Arg Ser Arg Lys Cys Gin Asp Arg Gin Asp Leu 

110 115 120 

Tyr Tyr Leu Ala Gly Thr Tyr Asp Pro Thr Thr Gly Arg Leu Val 

125 130 ' 135 

Thr Ala Asp Gly Val Pro lie Leu Cys 

140 \ - _ ; 

<210> 8 

<211> 174 •* < 
<212> PRT 

<213> Homo sapiens 

<220> ■. ; 

<221> mis c_ feature ■ 

<223> Incyte ID No: 103561CD1 

<400> 8 ' ' 

Met Ala Pro Pro Val Arg Leu Glu Arg Pro Phe Pro Ser Arg Arg 
1 : 5 10 15 

Phe Pro Gly Leu Leu Leu Ala Ala Leu Val Leu Leu Leu Ser Ser 
20 2.5 30 

Phe Ser Asp Gin Cys Asn Val Pro Glu Trp Leu Pro Phe Ala Arg 
35 40 45 

Pro Thr Asn Leu Thr Asp Asp Phe Glu Phe Pro He Gly Thr Tyr 
50 55 60 

Leu Asn Tyr Glu Cys Arg Pro Gly Tyr Ser Gly Arg Pro Phe Ser 
,65 70 75 

He He Cys Leu Lys Asn Ser Val Trp Thr Ser Ala Lys Asp Lys 
80 85 90 

Cys Lys Arg Lys Ser Cys Arg Asn Pro Pro Asp Pro Val Asn Gly 
95 100 105 

Met Ala His Val He Lys Asp He Gin Phe Gly Ser Gin He Lys 
110 115 120 

Tyr Ser Cys Pro Lys Gly Tyr Arg Leu He Gly Ser Ser Ser Ala 
125 130 135 

Thr Cys He He Ser Gly Asn Thr Val He Trp Asp Asn Lys Thr 
140 145 150 

Pro Val Cys Asp Ser Glu Leu Lys Tyr Ala Phe Leu Phe Leu Leu 
155 160 '* 165 

Pro lie His Ser Asn Phe Ser Leu Glu 
170 

<210> 9 
<211> 449 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<2 2 3> Incyte ID No: 288709CD1 

<400> 9 

Met Gin Leu Asp Trp Asn Gin Ala Gin Lys Ser Gly Asp Pro Gly 
1 5 10 15 

Pro Ser Val Val Gly Leu Val Ser He Pro Gly Met Gly Lys Leu 

20 25 30 

Leu Ala Glu Ala Pro Leu Val Leu Glu Pro Glu Lys Gin Met Leu 

35 40 45 

Leu His Glu Thr His Gin Gly Leu Leu Gin Asp Gly Ser Pro He 

50 .55 60 

Leu Leu Ser Asp Val lie Ser Ala Phe Leu Ser Asn Asn Asp Thr 
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Gin Asn Leu S er Ser Pro Val Thr Phe Thr Phe Ser His Arg Ser - 

"80 oD . • Jyj 

Val lie Pro Arg Gin Lys Val Leu Cys Val Phe Trp Glu His Gly 

Gin Asn Gly Cys g" His Tr^ Ala Thr Thr Gly Cys Ser Thr lie 

no 115 . . i 

Gly Thr Arg Asp Thr Ser- Thr lie Cys Arg Cys Thr Hxs Leu Ser 

Ser Phe Al~a Val leu Met Ala His Tyr Asp Val Gin Glu Glu Asp - 

140 145 AJ 

Pro val Leu Thr Val He Thr Tyr Met Gly Leu Ser Val Ser Leu 

155 160 
Leu Cys Leu Leu Leu Ala Ala Leu Thr Phe Leu- Leu Cys Lys Ala 

170 175 i. 

lie Gin Asn Thr Ser Thr Ser Leu His Leu Gin Leu Ser Leu Cys 

185 190 A " 

Leu Phe Leu Ala His Leu Leu Phe Leu Val Gly lie Asp Arg Thr 

200 205 . 1~ 

Glu Pro Lys Val Leu Cys Ser He He Ala Gly Ala Leu His Tyr 

215 220 . 

Leu Tyr Leu Ala Ala Phe Thr Trp Met Leu Leu Glu Gly Val His 

230 235 1 

Leu Phe Leu Thr Ala Arg Asn Leu Thr Val Val Asn Tyr Ser Ser 

He Asn Arg Leu Met Lys Trp. He Met Phe Pro Val Gly Tyr Gly 

260 265 . 

Val Pro Ala Val Thr Val Ala He Ser Ala Ala Ser Trp Pro His. 

Leu Tyr Gly Thr Ala Asp Arg- Cys Trp Leu His Leu Asp Gin Gly 

290 2.95. J - 

Phe Met Trp Ser Phe Leu Gly. Pro Val Cys Ala He Phe Ser Ala 

305 310 " J 

Asn Leu Val Leu Phe He Leu Val Phe. Trp He Leu Lys Arg Lys 

320 325 . . . ■ J 

Leu Ser Ser Leu Asn Ser Glu Val Ser Thr He Gin Asn Thr Arg 

335 340 J *- > 

Met Leu Ala Phe Lys Ala Thr Ala Gin Leu Phe . He Leu Gly Cys 

350 355 -T 

Thr Trp Cys Leu Gly Leu Leu Gin Val Gly Pro Ala Ala Gin Val 

Met Ala Tyr Leu Phe Thr lie He Asn Ser Leu Gin Gly Phe Phe 

380 385 " v 

He Phe Leu Val Tyr Cys Leu Leu Ser Gin Gin Val Gin Lys Gin 

395 400 
Tyr Gin Lys Trp Phe Arg Glu lie Val Lys Ser Lys Ser Glu Ser 

410 415 * 

Glu Thr Tyr Thr Leu Ser Ser Lys Met Gly Pro Asp Ser Lys Pro 

425 4 ^0 
Ser Glu Gly Asp Val Phe Pro Gly Gin Val Lys Arg Lys Tyr 

440 ... 445 . " \ ■ 

<210> 10 

<211> 126 - : * - . . ^ ... • 

<212> PRT 

<213> Homo sapiens ; : : : ... 1 

.* <220> . . ... -.:.-<: - '•■ - * . 

<221> ' misc_f eature 

<223> incyte ID No: 959893CD1 

Se2 0 Slu 0 Ser Phe Leu Gly Gly Val Leu Leu He Leu Trp Leu Gin 
Vai Asp Trp Val Lys Ser Gin Lys He Glu Gin Asn Ser Glu Ala 

" " • . ; 10/36 --/-. 
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20 








25 




Leu Asn 


He 


Gin 


Glu 


Gly 


Lys 


Thr Ala 


Thr 


Leu 








35 








40 




Thr Asn 


Tyr 


Ser 


Pro 


Ala 


Tyr 


Leu Gin 


Trp 


Tyr 








50 








55 




Gly Arg 


Gly 


Pro 


Val 


Phe 


Leu 


Leu Leu 


He 


Arg 








65 








. 70 




Glu Lys 


Arg 


Lys Glu 


Arg 


Leu 


Lys Val 


Thr 


Phe 








80 








85 




Lys Gin 


Ser 


Leu 


Phe 


His 


He 


Thr Ala 


Ser 


Gin 








95 








100 




Ala Asn 


Tyr 


Leu 


Cys 


Ala 


Leu 


Gly Gly Arg Gly 








110 








115 




Leu Gly Gin 


Ala 


Leu 


Ser 
















125 













<210> 11 
<211> 273 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_ feature 

<223> Incyte ID No: 1414179CD1 

<400> 11 



Met Gly Arg 


Ser 


Arg 


Ser 


Arg 


Ser 


Ser 


Ser 


Arg 


1 








* 5 










10 




Lys 


Ser 


Ser 


Lys 


His 


Asn 


Lys 


Lys 


Arg 


Ser 


Arg 










20 








25 




Ser Arg 


Asp 


Lys 


Glu 


Arg 


Val 


Arg 


Lys 


Arg 


Ser 










35 










40 




Ser 


Lys 


Arg 


Asn 


Arg 


Arg 


Arg 


Glu 


Ser 


Arg 


Ser 










50 










55 




Thr 


Asn 


Thr 


Ala 


Val 


Ser 


Arg 


Arg 


Glu 


Arg 


Asp 










65 










70 




Ser 


Ser 


Pro 


Pro 


Asp 


Arg 


He 


Asp 


He 


Phe 


Gly 










80 










85 




Lys 


Arg 


Ser 


Ser 


Leu 


Asp 


Glu 


Lys 


Gin 


Lys 


Arg 










95 










.100 




Lys 


Lys 


Ala 


Glu 


Phe 


Glu 


Arg 


Gin 


Arg. Lys 


He 










110 










115 




He 


Glu 


Glu 


Lys 


Leu 


He 


Glu 


Glu 


Glu 


Thr 


Ala 










125 










130 




Glu 


Leu 


Val 


Ala 


Lys 


Arg 


Val 


Glu 


Glu 


Glu 


Leu 










140 










145 




Asp 


Glu 


lie 


Glu 


Arg 


Glu 


Val 


Leu 


Arg 


Arg 


Val 










155 










160 




Arg 


He 


Met 


Glu 


Lys 


Gin 


Leu 


Leu 


Glu 


Glu 


Leu 










170 










175 




Gin 


Ala 


Glu 


Leu 


Ala 


Ala 


Gin 


Lys 


Ala 


Arg 


Glu 










185 










190 




Ala 


Lys 


Arg 


Glu 


Glu 


Leu 


Glu 


Arg 


He 


Leu 


Glu 










200 










205 




Lys. 


He 


Ala 


Glu 


Ala 


Gin 


Ala 


Lys 


Leu 


Ala 


Glu 










215 










220 




He 


Val 


Glu 


Glu 


Gin 


Arg 


Lys 


He 


His 


Glu 


Glu 










230 










235 




Glu 


Gin 


Glu 


Arg 


Gin 


Arg 


Gin 


Gin 


Lys 


Glu 


Glu 










245 










250 




Leu 


Gly 


Lys 


Gly 


Lys 


Ser 


Arg 


Pro 


Lys 


Leu 


Ser 










260 










265 




Thr 


Gin 


Asp 



















<210> 12 





30 




Thr Cys Asn 


Tyr 






45 




Arg Gin Asp 


Pro 






60 




Glu Asn Glu 


Lys 






75 7 




Asp Thr Thr 


Leu 






-90 




Pro Ala Asp 


Ser 






105 - 




Thr Asn Ser 


Pro 




- < - ■ * , 


120 - 




- " ■ ■ 

Ser Lys His 


Thr 






15 


'- - . * '-4 


Ser Arg Ser 


Arg . 






30 




Lys Ser Arg 


Glu 


* > y -A 




45 


■ 


Arg Ser Arg 


Ser \ 






60 ~ 




Arg Glu Arg 


Ala 






75 " ' • 




Arg Thr Val 


Ser 


* \ 




90 




Glu Glu Glu 


Glu 






105 


Arg Gin Gin 


Glu 






120 - : 




Arg Arg Val 


Glu 






135 ' ' ? 




Glu Lys Arg 


Lys 






150 




Glu Glu Ala 


Lys 






165 




Glu Arg Gin 


Arg 






180 ' ~ 




Glu Glu Glu 


Arg 






195 




Glu Asn Asn 


Arg 






210 




Glu Gin Leu 


Arg 






225 




Arg Met Lys 


Leu 






240 




Gin Lys lie 


He 






255 • 




Phe Ser Leu 


Lys 






270 
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<211> 140' - ' ", ; . . ' ■ . :. ■ :.,/■•'•".>.*-•• •-■*<',' r . ■ . - . 

<212>. PRT W". ' . .. ■ ' 

<213> Homo sapiens. • ^ . . v ; 

<220> V ; : . ' ,'»..•• : : " ■ ' 

<221> misc_ feature 

<223> Incyte ID No: 2197211CD1. 

Met Q GlJ 2 Lys Met Leu Glu Cys Ala Phe lie VaiwTrp Leu^Gln 
X 5 10 15 

Leu Gly Trp Leu Ser Gly Glu Asp Gin Val Thr Gin Ser Pro Glu 

20 25 ^ 30 

Ala Leu Arg Leu Gin Glu Gly Glu Ser Ser Ser Leu Asn Cys Ser 

35 40 45 

Tyr Thr Val Ser Gly Leu Arg Gly Leu Phe Trp Tyr Arg Gin Asp 

50 55 60 

Pro Gly Lys Gly Pro Glu Phe Leu Phe Thr Leu Tyr Ser Ala Gly. 

65 70 ^ 75 

Glu Glu Lys Glu Lys Glu Arg Leu Lys Ala Thr Leu Thr Lys Lys 

80 85 JO 

Glu Ser Phe Leu His lie Thr Ala Pro Lys Pro Glu Asp Ser Ala 

95 100 lo;> 

Ser Tyr Leu Cys Ala Val Gin Gly Gly He Gly Asn Val Leu His 

HO 115 
Cys Gly Ser Gly Thr Gin Val Val Val Leu Pro His He Arg Asp 
125 130 135 

Pro Gly Pro Ala val 

- 140 ' ... -\ ; \ 

<210> 13 
<211> 479. 

<212> PRT ; ' 

<213> Homo sapiens, T . , 

<220> , ; , , 

<221> misc_f eature 

<223> Incyte ID No: 2263653CD1 



<400> 13 
Met Ala 

i 


Val 


Leu Gly Val Gin 
5 


Leu 


Val 


Val 
10 


Thr 


Leu 


Leu 


Thr 


Ala 

15 




Thr 


Leu 


Met 


His 


Arg" 
20 


Leu 


Ala 


Pro 


His 


Cys 
25 


Ser 


Phe 


Ala 


Arg 


Trp 
30 




Leu 


Leu 


Cys 


Asn 


Gly 
35 


Ser 


Leu 


Phe 


Arg 


Tyr 
40 


Lys 


His 


Pro 


Ser 


Glu 
45 




Glu 


Glu 


Leu 


Arg 


Ala 


Leu 


Ala 


Gly 


Lys 


Pro 


Arg 


Pro 


Arg 


Gly 


Arg 
60 






50 










55- 








Leu 




Lys 


Glu Arg Trp Ala Asn Gly 
65 


Leu 


Ser 


Glu 
70 


Glu 


Lys 


Pro 


Ser 
75 




Val 
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Phe Leu Ser His Leu Ala Val Val Asp He Ala Tyr Ala Cys Asn 
"■: " ' "'■ .■ ' 65 : • ' - , *■ • ■. --70- ■ - 75 . . . ; ■ / . 

_l-jrhr Val Pro. Arg Metleu Val Asn Leu Leu His Pro Ala Lys Pro : - 1 

" r ~ . "'" ' - " 80 •: ^ ■"■ 85 90 , "\ ~ 

He Ser Phe Ala Gly Arg Met Met Gin Thr Phe Leu Phe Ser Thr 

. .95 100 \ ■ ■ 105 - 

" phe Ala Val Thr Glu. Cys Leu Leu Leu Val Val Met Ser Tyr Asp - ; V ^ 

; "no" * ..- '• *' lis ' 120 . 

Leu Tyr Val Ala lie Cys His' Pro Leu Arg Tyr Leu Ala lie Met , - 

125 130 135 

Thr Trp Arg Val Cys lie Thr Leu Ala Val Thr Ser Trp Thr Thr 
140 r 145 150 

Gly Val Leu Leu Ser Leu lie His Leu Val Leu Leu Leu Pro Leu 

155 160 165 , 

Pro Phe Cys Arg Pro Gin Lys He Tyr His Phe Phe Cys Glu lie \ r /' : 

/ : 170 175 180 . , • . 

Leu Ala Val Leu Lys Leu Ala Cys Ala Asp Thr His lie Asn Glu 
185 190 19 f 

• Asn Met Val Leu Ala Gly Ala lie Ser Gly Leu Val. Gly Pro Leu ;. 

200 . : ' ' 205 210 . 

Ser Thr lie Val Val Ser Tyr Met Cys: : lie Leu Cys Ala He Leu - 
: . " 215 220 225 '; 7 

Gin lie Gin Ser Arg Glu Val Gin Arg Lys Ala Phe Cys Thr Cys 
230 ■ 235 240 

Phe Ser His Leu Cys. Val lie Gly Leu Phe Tyr Gly; Thr Ala lie ■ ■ , 

245 250 255 " 

He M et Tyr Val Gly Pro Arg Tyr Gly Asn Pro Lys Glu Gin Lys • 
, . 260 265 270 

Lys Tyr Leu Leu Leu Phe His Ser Leu Phe Asn Pro Met Leu Asn, 
275 280 -•" 285 

Pro Leu lie Cys Ser Leu Arg Asn Ser Glu Val Lys Asn Thr Leu 
; 290 295 300 

Lys Arg Val Leu Gly Val Glu Arg Ala Leu ' v 

. 305 : ; ' " .;- ; 310... '. , . .7 . - • 

<2io> 20 • : • . • . ; ■ '. ; .- '. ■ . • .\ •.' •• ••' ■" : ' . . • -. ;r 

;■ <211> 438 *• - V s - " > 

■•\.,' <2i2> prt ,* .v ; > ': \ v.- ' • . '■ • " '•• " .;-/". •* : " v Vv 

<213> Homo sapiens • " : . " . . : \ r / : ~ 

: ~ <220> ' - • , -\ . , ; Z : ' ; " r - '■■ \ , .\ -'■ 

<221> roisc_feature . 
' . <223> Incyte ID No: 2436258CD1 

/ Met°Glu°val Gly Gly Asp Thr Ala Ala Pro Ala Pro Gly Gly Ala 
1 V 5 10 I 5 

Glu Asp Leu Glu Asp Thr Gin Phe Pro Ser Glu Glu Ala Arg Glu * ■ 

20 25 < - 30 

Gly Gly Gly Val His Ala Val Pro Pro Asp Pro Glu Asp Glu Gly 
35 40 45 

Leu Glu Glu Thr Glu Asp His Lys Leu Val Phe Leu ; Gin Gin Gly • 

so : 55 60 : • ; 

Pro Leu Leu Leu Val Ala • Met Ser Arg Thr Ser Gin Ser Ala Ala / . , . v ; 

65 70 - \ 75 

Gin Leu Arg Gly Glu Leu Leu Ala Val His Ala Gin He Val, Ser. , y. : 

80 : 85 - 90 

Thr Leu Thr Arg Ala Ser Val Ala Arg He Phe Ala His Lys Gin , . 

, 18/36 ■' " ' .-■ "•' ■ ' 
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95 










100 










105 


Asn 


Tyr 


Asp 


Leu 


Arg 




Leu 


Leu 


Ala 




Ser 


Glu 


Arg 


Thr 


Leu 










110 










115 










120 




Arg 


Leu 


Leu 


Asp 


Ser 


Met 


Glu 


Gin 


Asp 


Pro Gly Ala 


Leu 


Leu 










125 










130 










135 


Leu 


Gly Ala 


Val 


Arc 


Cvs 


Val 


Pro 


Leu 


Ala 


Ara 


Pro 


Leu 


Arg 


Asp 
* ***** 










140 










145 










150 


Ala 


Leu Gly Ala 


Leu 


Leu 


Arg 


Arg 


Cys 


Thr 


Ala 


Pro Gly 


Leu 


Ala 










155 










160 










165 


Leu ' 




Val 


Leu 


Ala 


Val 


Gly Gly Arg 


Leu 


lie 


Thr 


Ala 


Ala 


Gin 










170 










175 










180 


Glu 


Axy 


Asn 


Val 


Leu 


Ala 


Glu 


Cys 


Arg 


Leu 


Asp 


Pro 


Ala 


Aso 


Leu 










185 










190. 










195 


Gin 


T .01 1 


L16U 


Lie u 


Asp 


Trn 

a x Yr 


va x 


Gly Ala 


Pro 


Ala 


Phe 


Ala 


Ala 


G1V 










200 










205 










210 


Glu 


Axa 


■ 

Trp 




Pro 


Val 


Cys 


Leu 


Pro 


Arg 




Asn 


Pro 


Asp 


Glv 










215 










220 










225 


pVip 


rTie 


Tyr 






Val 

vox 


nxa 


Arg 


Leu 


Asp 


Ala 


Met 


Pro 


Val 


\-ys 










230 










235 










240 


T .01 1 


Leu 


Leu 


Leu 


Glv 


Thr 


Vj -111 


Arg 


Glu 


Ala 


rl It; 


His 


Ala 


M<=>t~ 


Ala 










245 










250 










255 


Al a 


Cys 


Arg 


Arg 


Leu 


Val 


VjxU 


Asp 


Gly Met 


nx 0 


Ala 


Leu 


Glv 
vaxy 


Ala 










260 










265 










270 


Mot* 


Arg 




Leu 


Glv 


Glu 


Ala 


Ala 


Ser 


Phe 




Asn 


Ala 


Cat* 

OCX 












275 










280 










285 

£t O J 


AT a 
AXci 


Ser 


a j. a 


Pro 


Ala 


A Jr 1 


Ser 


Val 


Gin 


Ala 


Val 


Gly Ala 


Prn 

XT J. w 


Glv 
oxy 










A <7 W. 










295 










TOO 

J V w 


Leu 


Arg 


His 


pne 


T.011 
ucu 




i^ys 


Pro 


Leu 


Asp 


He 


Pro 


Asp 


Hi Q 


nxo 










305 










310 










■Jic: 

J X 3 


Arg 


Gin 


Leu 


Pro 


Gin 


Phe 


Thr 


Ser 


Pro 


Glu 


Leu 


Glu 


Ala 


XT X \J 












320 










325 










•J -J v 




Arg 


Glu 


Glu 


Glu 


Arg 


Gin 


Arg 


Leu 


Ser 


Asp 


Leu 


Tyr 


Hi <; 
nx a 












335 










340 












Leu 


His 


Ala 


Arg 


Leu 


His 


Ser 


Thr 


Ser 


Arg 


Pro 


Leu 


Arg 


Leu 


lie 










350 










355 








360 


Tyr 


His 


Val 


Ala 


Glu Lys 


Glu 


Thr 


Leu 


Leu. Ala 


Trp 


Val 


Thr 

X I IX 












365 










370 










375 


Lys 


Phe 


Glu 


Leu 


Tyr 


Thr 


Cys 


Leu 


Ser 


Pro 


Leu 


Val 


Thr 


Lys 


Ala 










380 










3 85 










390 


Gly Ala 


lie 


Leu 


Val 


Val 


Thr 


Lys 


Leu 


Leu 


Arg 


Trp 


Val 


Lys 


Lys 










395 










400 










405 


Glu 


Glu 


Asp 


Arg 


Leu 


Phe 


lie 


Arg 


Tyr 


Pro 


Pro 


Lys 


Tyr 


Ser 


Thr 










410 










415 










420 


Pro 


Pro 


Ala 


Thr 


Ser 


Thr 


Asp 


Gin 


Ala 


Ala 


His 


Asn Gly 


Leu 


Phe 










425 










430 










435 



Thr Gly Leu 

<210> 21 
<211> 357 
<212> PRT 

<213> Homo sapiens 

<220> c : 

<221> misc_feature 

<223> Incyte ID No: 2681738CD1 



<400> 21 



Met 


Ala 


Thr 


Thr 


Val 


Pro 


Asp Gly Cys 


Arg 


Asn 


Gly 


Leu 


Lys 


Ser 


1 








5 






10 










15 


Lys 


Tyr 


Tyr 


Arg 


Leu 


Cys 


Asp Lys Ala 


Glu 


Ala 


Trp 


Gly 


lie 


Val 










20 






25 










30 


Leu 


Glu 


Thr 


Val 


Ala 


Thr 


Ala Gly Val 


Val 


Thr 


Ser 


Val 


Ala 


Phe 










35 






40 










45 


Met 


Leu 


Thr 


Leu 


Pro 


lie 


Leu Val Cys 


Lys 


Val 


Gin 


Asp 


Ser 


Asn 
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Arg Arg Lys Met Leu Pro Thr Gin Phe Leu Phe Leu Leu Gly Val 

65 70 75 v 

Leu Gly He Phe Gly Leu Thr Phe Ala Phe lie He Gly Leu Asp 
80 85 1° 

Gly Ser Thr Gly Pro Thr Arg Phe Phe Leu Phe Gly He Leu Phe 
95 100 \ 105 

Ser He Cys Phe Ser Cys Leu Leu Ala His Ala Val Ser Leu Thr ; 
v ■_. no _ ; „ ^^JA 5 

Lys Leu Val Arg Gly Arg Lys Pro Leu Ser £eu Leu Val He' Leu ~: 

125 130 135 

Gly Leu Ala Val Gly Phe Ser Leu Val Gin Asp Val He Ala lie. 

.... 140 145 v 150 

Glu Tyr He Val Leu Thr Met Ash Arg Thr Asn Val Asn Val Phe 

155 160 165 

Ser Glu Leu- Ser Ala Pro Arg Arg- Asn Glu Asp Phe Val Leu Leu 

170 175 180 

Leu Thr Tyr Val Leu Phe Leu Met Ala Leu Thr Phe Leu Met Ser 

185 19° 195 

Ser Phe Thr Phe Cys Gly Ser Phe Thr Gly Trp Lys Arg His Gly 

200 205 210 

Ala His lie Tyr Leu Thr Met Leu Leu Ser He Ala He Trp Val 

215 220 225 

Ala Trp He Thr Leu Leu Met Leu Pro Asp Phe Asp Arg Arg Trp 

230 235 - .. 240 

Asp Asp Thr He Leu Ser Ser Ala Leu Ala Ala Asn Gly Trp Val 

245 250 255 

Phe Leu Leu Ala Tyr Val Ser Pro Glu Phe Trp Leu Leu Thr Lys 

260 265 270 

Gin Arg Asn Pro Met: Asp; Tyr Pro Val Glu, Asp Ala Phe Cys Lys 

275 280 285- 

Pro Gin Leu Val Lys Lys Ser Tyr Gly Val Glu Asn Arg Ala Tyr; 

290 295 : 300 

Ser Gin Glu Glu lie Thr Gin Gly Phe Glu Glu Thr Gly Asp Thr 

305, 310 . 315 : 

Leu Tyr Ala Pro Tyr Ser Thr His Phe Gin Leu Gin Asn Gin Pro 

320 325 330 

Pro Gin Lys Glu Phe Ser lie Pro Arg Ala His Ala Trp Pro Ser 
. 335 .'. 340 345 

Pro Tyr Lys Asp Tyr Glu Val Lys Lys Glu Gly Ser 
V 350: r ,355 

<210> 22 

<211> 1069 : 
<212> PRT 

<213> Homo sapiens 

<220>- , • „. 

<221> misc__feature 

<223> Incyte ID No: 2859482CD1 

Met°Asp 2 Asp Lys Ala Ser Val Gly Lys lie Ser Val Ser Ser Asp 
1 5 10 15 

Ser Val Ser Thr Leu Asn Ser Glu Asp Phe Val Leu Val Ser Arg 

20 v 25 30 

Gin Gly Asp Glu Thr Pro Ser Thr Asn Asn Gly Ser Asp Asp Glu 

35 r 40 . 45 

Lys Thr Gly Leu Lys lie Val Gly Asn Gly Ser Glu Gin Gin Leu 

50 55 . ' ,60 

Gin Lys Glu Leu Ala Asp Val Leu Met Asp Pro Pro Met Asp Asp 

65 . - • 70 ....... 75 

Gin Pro Gly Glu Lys Glu Leu Val Lys Arg Ser Gin Leu Asp Gly 

80 ., . 85 ... ;,. 90 

Glu Gly Asp Gly Pro Leu Ser Asn Gin Levi Ser Ala Ser Ser Thr 

" ~~ " ~ ^ " ~~ ™ --^ - - - - - 20/36 ^ ~ ^ ~ ~~ ~ : " 
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95 






100 










105 


lie 


i Asn 


i Pro 


Val Pro Leu 


l Val Gly 


Leu 


. Gin 


Lys 


Pro 


Glu 


Met 


Ser 








110 






115 










120 


Leu 


Pro 


> Val 


Lys Pro Gly 


' Gin Gly Asp 


' Ser 


Glu 


Ala 


Ser 


Ser 


Pro 








125 






130 










135 


Phe 


Thr 


Pro 


Val Ala Asp 


> Glu Asp 


Ser 


Val 


Val 


Phe 


Ser 


Lys 


Leu 








140 






145 










150 


Thr 


Tyr 


Leu 


Gly Cys Ala 


Ser Val 


Asn 


Ala 


Pro 


Arg 


Ser 


Glu 


Val 








155 






160 










165 


Glu 


Ala 


Leu 


Arg Met Met 


Ser lie 


Leu 


Arg 


Ser 


Gin 


Cys 


Gin 


He 








170 






175. 








180 


Ser 


Leu 


Asp 


Val Thr Leu 


Ser Val 


Pro 


Asn 


Val 


Ser 


Glu 


Gly lie 








185 






190 










195 


Val 


Arg 


Leu 


Leu Asp Pro 


Gin Thr 


Asn 


Thr 


Glu 


He 


Ala 


Asn 


Tyr 








200 






205 










210 


Pro 


He 


Tyr 


Lys He Leu 


Phe Cys 


Val 


Arg 


Gly 


His 


Asp Gly Thr 








215 






220 










225 


Pro 


Glu 


Ser 


Asp Cys Phe 


Ala Phe 


Thr 


Glu 


Ser 


His 




A en 
noil 


Al a 








230 






235 












Glu 


Leu 


Phe 


Arg He His 


Val Phe 


Arg 


Cvs 


Glu 


He 


Gl n 


Gl ti 


Al A 








245 






250 










255 


Val 


Ser 


Arg 


He Leu Tyr 


Ser Phe 


Ala 


Thr 


Ala 


Phe 


Arg 


A Trf 


Oca 








260 






265 












Ala 


Lys 


Gin 


Thr Pro Leu 


Ser Ala 


Thr 


Ala 


Ala 


Pro 


Gin 


X 1 i_t 


Dm 








275 






280 










2 85 


Asp 


Ser 


Asp 


lie Phe Thr 


Phe Ser 


Val 


Ser 


Leu 


Glu 


Tip 


T A/C 


Gl ii 








290 






295 










300 


Asp 


Asp 


Gly 


Lys Gly Tyr 


Phe Ser 


Ala 


Val 


Pro 


Lys 


Asp 


Lys 


Asp 








305 






310 










315 


Arg 


Gin 


Cys 


Phe Lys Leu 


Arg Gin Gly 


He 


Asp 


Lys 


Lys 


He 


Val 








320 






325 










330 


He 


Tyr 


Val 


Gin Gin Thr 


Thr Asn 


Lys 


Glu 


Leu 


Ala 


lie 


Glu 


Ara 








335 






340 










345 


Cys 


Phe 


Gly 


Leu Leu Leu 


Ser Pro 


Gly 


Lys 


Asp 


Val 


Arg 


Asn 


Ser 








350 






355 










360 


Asp 


Met 


His 


Leu Leu Asp 


Leu Glu 


Ser 


Met 


Gly Lys 


Ser 


Ser 


Asp 








365 






370 










375 


Gly 


Lys 


Ser 


Tyr Val He 


Thr Gly 


Ser 


Trp 


Asn 


Pro 


Lys 


Ser 


Pro 








380 






385 










390 


' His 


Phe 


Glh 


Val Val Asn 


Glu Glu 


Thr 


Pro 


Lys 


Asp 


Lys 


Val 


Leu 








395 






400 










405 


Phe 


Met 


Thr 


Thr Ala Val 


Asp Leu 


val 


He 


Thr 


Glu 


Val 


Gin 


Glu 








410 






415 










420 


Pro 


Val 


Arg 


Phe Leu Leu 


Glu Thr 


Lys 


Val 


Arg 


Val 


Cys 


Ser 


Pro. 








425 






430 










435 


Asn 


Glu 


Arg 


Leu Phe Trp 


Pro Phe 


Ser 


Lys 


Arg 


Ser 


Thr 


Thr 


Glu 








440 






445 










450 


Asn 


Phe 


Phe 


Leu Lys Leu. 


Lys Gin 


He 


Lys 


Gin 


Arg 


Glu 


Arg 


Lys 








455 






460 










465 


Asn 


Asn 


Thr 


Asp Thr Leu 


Tyr Glu 


Val 


Val 


Cys 


Leu 


Glu 


Ser 


Glu 








470 






475 










480 


Ser 


Glu 


Arg 


Glu Arg Arg 


Lys Thr 


Thr 


Ala 


Ser 


Pro 


Ser 


Val 


Arg 








485 






490, 










495 * 


Leu 


Pro 


Gin 


Ser Gly Ser 


Gin Ser 


Ser 


Val 


He 


Pro 


Ser 


Pro 


Pro 








500 






505 : 










510 


Glu 


Asp 


Asp 


Glu Glu Glu 


Asp Asn 


Asp 


Glu 


Pro 


Leu 


Leu 


Ser Gly 








515 






520 










525 


Ser Gly Asp Val Ser Lys 


Glu Cys 


Ala 


Glu 


Lys 


lie 


Leu 


Glu 


Thr 








530 






535 










540 


Trp 


Gly 


Glu 


Leu Leu Ser 


Lys Trp 


His 


Leu 


Asn 


Leu 


Asn 


Val 


Arg 








545 






550 










555 


Pro 


Lys 


Gin 


Leu Ser Ser 


Leu Val 


Arg 


Asn 


Gly 


Val 


Pro 


Glu 


Ala 








560 






565. 








570 


Leu 


Arg 


Gly Glu Val Trp 


Gin Leu 


Leu 


Ala '■ 


Gly Cys 


His 


Asn 


Asn 
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: : ■ \ 575 : : : \ - . 580 585 

Asp His Leu Val Glu Lys Tyr Arg lie Leu He Thr Lys Glu Ser 
590 595 600 

Pro Gin Asp Ser Ala He Thr Arg Asp lie Asn Arg Thr Phe Pro 
605 . 610 615 

Ala His Asp Tyr Phe Lys Asp Thr Gly Gly Asp Gly Gin Asp Ser 
620 ... 625 / 630 

Leu Tyr Lys lie Cys Lys Ala Tyr Ser Val Tyr Asp Glu Glu lie 
635 _ 640 645 

Gly Tyr Cys Gin Gly Gin Ser Phe Leu Ala Ala Val" Leu Leu Leu 
650 655 660 

Lys He Met 
675 

l Phe Glu Asp 
690 

; Gin Glu Tyr 
705 

■ Leu Glu Ala 
720 

i Thr Ala Lys 
. ,735 
i Leu Leu Cys 
750 

r Leu Leu Lys 
765 

i Gly Ala Leu 
780 

j Ser Glu Glu 
/ 795 
: Lys lie Ser . 

810; 
3 Thr Met Arg 
825 

j Phe Glu Arg 

840 - 
x Glu Glh Glu 
855 

s lie Ala Leu 
870 

d Ala Leu Asn 
885 

p Ala Glu Glu 
900 

u Lys Glu Met 
915 

e Lys Lys Asn 
930 

r Gin Leu Ser 
945 

1 Glu lie Glu 
: 960 
s Arg Glu Phe 
975 

r Thr Lys Glu 
990 

r Leu Lys Asn 
1005 

r Lys Leu Gin 
1020 

u His His Leu 
,1035 

s Thr Trp Phe 
, 1050 



Glu 


Glu 


Gin 


Ala 


Phe 


Ser ■ 


Val 


Leu 1 




665 










670 




Gly 


Leu 


Arg 


Glu 


Leu 


Phe 


Lys 


Gin, 


680 










685 




Lys 


Phe 


Tyr 


Gin 


Leu Glu Arg 


Leu 


695 










700 




Leu 


Tyr 


Asn 


His 


Phe 


Leu 


Asp 


He 




710 










715 




Ala 


Ser Gin 


Trp 


Phe 


Leu 


Thr 


Leu 




J 25 










730 




Tyr 


Met 


Val 


Phe 


His 


He 


lie Asp 


740 










745 


Leu 


Ser 


Val 


He 


Phe 


Asn 


Val 


Ala 




755 










760 




Asp 


Asp 


Leu 


Leu 


Leu 


Thr 


Asp 


Phe 


770 










775 




Arg 


Val 


Gin 


Leu 


Pro 


Lys 


Arg 


Tyr 


785 










790 




Lys 


Leu 


Met 


Glu 


Leu Ala 


Cys 


Asn 


800 










805. 




Leu 


Lys 


Lys 


Tyr 


Glu 


Lys 


Glu 


Tyr 




815 










820 




Ala 


Gin 


Gin 


Glu 


Asp 


Pro 


lie 


Glu 




830 










835 




Arg Leu 


Gin 


Glu 


Ala 


Asn 


Met 


Arg 




845 










850 




Leu 


Ala 


His 


Glu 


Leu 


Val 


Thr 


Ser * 




860 










865 




Leu 


Asp 


Asn 


Ala 


Glu 


Glu 


Lys 


Ala 




875. 










880 




Leu 


Met 


Thr 


Lys 


Gin Lys 


Leu 


He 




890 










895 




Arg 


Leu 


Glu 


Glu 


Glu 


Ser 


Ala 


Gin 


905 










910 




Glu 


Leu .j Asp 


Lys 


Ala 


Glu 


Ser 


Glu 




920 










925 




He 


Gly Asp. Tyr 


Lys 


Gin 


He 


Cys 




935 










940 




Glu 


Lys 


Gin 


Gin 


Thr 


Ala 


Asn 


Lys 




950 










955 




Gin 


Lys 


Val 


Asp 


Asp 


Cys 


Glu 


Arg 




965 










970 




Glu Gly Arg 


Val 


Lys 


Gly 


lie 


Ser 




980 










985 




Glu 


Asp 


Thr 


Asp 


Glu 


Glu 


Lys 


Glu 




995 










1000 




Glu 


Met 


Glu 


Leu 


Glu 


Leu Ala 


Gin 




1010 










1015 




Ala 


Glu 


Cys 


Lys 


He 


,Gln 


Asp. Leu 




1025 










1030 




; Leu Asn 


Glu 


Val 


Gin 


Ala 


Ala 


Lys 




1040 










1045 




* Leu 


Ser 


Ser 


He 


Lys 


Thr 


Ala 


Thr 
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Lys Glu Thr Cys 



1055 



1060 



1065 



<210> 23 
<211> 1995 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 209171CB1 



<400> 23 

ggaccgtctt 

tggcagctgg 

gaggaaagag 

agctcaggaa 

tttgacagat 

actgggaggg 

tgtctaccta 

ttctggggta 

ctctgccagt 

aaaaaaggtt 

ggccaaggag 

gacaaagagg 

caggaaagct 

agtcatggtc 

cctgcttcac 

agttctgcag 

gaccttgaaa 

atggcagata 

ttctcatcca 

attgaacgac 

agacaaaaag 

attgacaaga 

tgtcatgtgg 

agtggaagag 

gaaaaaggcc 

tcagatgcag 

gaagaggcag 

tctcccgtca 

aagcaagtta 

agaaaattct 

catgttcatt 

tttcaccacc 

aactgtgtat 

tataaaaaaa 



cccagtccca 

tacctccatc 

aggagccttg 

gctgggacca 

gtactggcca 

gttctgcttt 

ccctagcaga 

ttccattctg 

tggaggttta 

ttggggaacc 

agcaggctag 

agaggcagga 

caattaaaag 

agtcttccca 

aaagtgttgc 

aagaaatcac 

gagaagtctc 

aggaggttgg 

gtaaccaggt 

atgccatgta 

aggccaagac 

atgagaagtg 

actcctgtct 

catgttcaac 

acagtgaagg 

acatcaagtc. 

gctgcagcag 

agtcttttgt 

ctgtccagcc 

gaatttctag 

aagccatagt 

agcacctacc 

tctgttttat 

aaaaa 



tcaagagaaa 
actgtttaaa 
ggaccacact 
gtcaagccag 
ctcggctgag 
tctcaaagct 
tgccaaaggt 
ccctgatgga 
tcaaaagagc 
agtgttacct 
tgagaaaaat 
gtctagggca 
cttgaaagag 
agggattgtt 
tgctttgacc 
tgtttgtcct 
tccaggtagc 
taacaaggaa 
atcctgcccg 
ctgcaatggt 
caagagtgac 
ttacctctgt 
ccagcttgca 
tgtggagggg 
ccgactcctt 
ttcagaaaca 
agagatgcag 
ttccatttca 
aggtagtcgg 
ggtccaaaag 
ggtccctagt 
cagcattctg 
aacagtttgt 



accacagact 
ggctcacata 
gaaaaaactg 
ccagtgtttg 
cacacacagt 
gtccagggta 
ctceaggaca 
gtagacccta 
ctgaaaatgg 
agacctcctt 
gaatgcatct 
tctgactggc 
aaacttttgt 
gaagaaactt 
agtaagagaa 
gagacccagc 
agagatatct 
gatgctgaga 
ctatgtgacc 
ctgatggagg 
agtgggacag 
aaatccctgg 
aaggctgacc 
aagtggcagc 
agtttcttgg 
ggagccttca 
agttctttca 
gaagccacag 
acacggacca 
ttgacaaaac 
tcattgttga 
gtttttatgt 
ttgaatttac 



ctgggctcac 
tcagtcaggg 
aagaggagcc 
agaatgtgaa 
gtgggaagcc 
gcggggacac 
ctgggggcac 
accagtatac 
ctcagaggca 
ctctgatcca 
cagaagatat 
actcaaaaac 
tggaggaaga 
ctgaagaggg 
gcttagtcct 
taagttcctc 
tggatggagt 
aggaagtagc 
aatgctttcc 
aagatacagt 
ctgcccagac 
tcccatttag 
aaggagatgg 
agaggctgaa 
aacagtctga 
gggtgccttc 
cacgtcgtga 
attgcttagt 
aagctggcag 
cattagtagg 
gcaagtttta 
tttttatgat' 
ttacagttaa 



tgaaggcata 
aaacgaggct 
ggtctctggc 
cgttaaatct 
acaggaaagt 
atctaggcac 
tgtgaactat 
caaggtcatt 
gctccttaat 
gaatgaatgt 
gggagatgaa 
caaggatttc 
accaacaacc 
aaactctgta 
tatgccagag 
tgaaactttt 
cagaataata 
tatttctacc 
acccacaaag 
attgactcgg 
ttctctagac 
agagtatcag 
acctgaaggg 
gaacccaaag 
gcacaagact 
accagggatg 
cttaaatgaa 
ggactttaaa 
aggaagaagg 
aggggtgggc 
gccctgcagt 
ctatgcagac 
aaaatttaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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1440 

1500 

1560 

1620 

1680, 

1740 

1800 

1860 

1920 

1980 

1995 



<210> 24 
<211> 2051 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 945430CB1 



<400> 24 

tcgaccacgc 

gatttggaaa 

tatcaggagt 

agttttcaaa 

gggtctgcca 

tgctcgagtc 



gtccgtgcgt 
tcactggatc 
cattctgcca 
aggtacatgg 
tgtgttgcac 
tggggacatt 



gttcactgtt 
tgctcaatac 
ctgcagtgga 
gaaatactcc 
cgctgctgac 
ttcctaagtg 



tagtagtact 
aaaaatgttt 
tttccttcct 
tcagaaaaaa 
cacagtggaa 
gctcaatggc 



caaaactgcc 
tttcgagtct 
gtgatggtgc 
gccgtctttg 
gaggctccac 
tctctacttc 



agtgtgagag 60 - 
ttctccattt 120 
accggctccc 180 
ggcagtgtcg 240 
ggggcatctc 300. 
gaattggacc 360 
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BNSDOCID: <WO 0107612A2_L> 
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PdYUSOO/20035 



tgggaaattc 
. tcaccagttc 
tacatataag 
tctcccggat 
taaagctgca 
ctacctctgc 
agaaaaggta 
tgacccggat 
taaggttatt 
ggtgatatgt 
aatgacaagg 
tgccacttct 
taatacgcgg 
cagcaaacct 
aattgatttg 
caggaaggct 
aaggtttgtt 
attgtcctat 
tgaaaatcta 
. ctatgatcga 
agtgggggat 
agatggcttt 
tggtggggtt 
agttttggat 
ttatgggttc 
aggtttaaaa 
aaaagggggg 
tccctatttg 
tgtaacaagg 



gagtttggga. 
agaatggcaa 
gccaacagtg 
ccatgcaaga 
gccatgactg 
actgagacca 
gattggagca 
ggaacagcat 
-cg.gg.ttcctc. 
tctattgctt 
aactatataa 
aaaattcggg 
tttcatgtgg 
tttgttacat 
tgctctcaag 
ggggaagggc 
ttgcctttaa 
acttcagcca 
catcaggagg 
ttcagtggca 
tctctgatca 
tatccctcag 
attctttctg 
gccaagaact 
catggtacct 
taagtgtgca 
caaggaggaa 
ggtgatgggt 

g 



aggataagta . 

agggcacagt 

ctaaaaaccg 

atgtttttga 

acaatactaa 

actttatgaa 

aatttattgc 

acaatatggg 

cagagaaggt 

ctacagagaa 

ttttcattga 

gaaaggcctt 

tggaaaaacg 

ttcatcaaat 

ataatggaag 

ttgatcaggt 

atgtcagttt 

gtgctgtgaa 

acrctagaaaa 

aaaagtatca 

aggttgatgt 

aacctgtttt 

tggtgatcac 

ttgaagagct 

tcatacccat 

cttggacata 

gaggggcagg 

tcattagaag 



caatcattgg 

gacatacagg 

aar.tgtgatc 

acgtttcatg 

tgtcaactat 

taaagtggac 

tgtgaatgga 

gaactccttt 

ggaccttggg 

agggaaacct 

acaacctcta 

ttcagatggg 

cactggacag 

caatgccttt 

aaccctagaa 

ccataattca 

gaatgcccct 

acaggctgat 

ggaaggaggc 

tttcttttat 

ggtgaataag 

tgttccagca 

tcccaaccag 

gggccgagca 

ctgatgggac 

aagactggag 

ggttaaaaag 

tccaaacctc 



tttgatggga 

agcaagtttc 

tcagaatttg 

tccaggtttg 

gtgcggtaca 

attgaaactc 

gcaactgcac 

gggccatatg 

gagacaatcc 

tcttactacc' 

aagatgaacc 

ataagctggg 

ctccttccag 

gaggaccagg 

gtttaccagt 

gcagccaaat 

gagggagaca 

ggaacgatct 

attgaatttc 

ggctgtggct 

acactgaagg 

ccaggaacca 

aatgaaagca 

gaggtacctg 

aaccacaagg 

aaataaacac 

ctacctattg 

agcagcacac 



tggcgctgct 

tacagagtga 

gcacactggc 

agctgcctgg 

agggtgatta 

tggaaaaaac 

atcctcatta 

gtttctccta 

atggagtcca 

atagctttgg 

tgtggaaaat 

aaccccagtg 

ggagatacta 

gctgtgttat 

tacagaatct 

ctttccctcg 

acctgagtcc 

ggtgctctca 

ctcagatcta 

ttcggcattt 

tttggagaga 

atgaagaaga 

attttctcct 

tgcagatgcc 

tctggaaact 

tgaggactcc 

aatactatgt 

aatatactca 
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480 
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780 

840 

900 

960- 

1020 

1080 

1140 
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1320 
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2040 
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<210> 25 

<211> 2067 

<212> DNA ^ 

<213> Homo sapiens 

<220> 

<221> misc — feature 

<223> Incyte ID No: 1305513CB1 



<400> 25 

ggcaaaaagc 

ctgctagttt 

aaacaaaact 

gcacccacgt 

ggcttactga 

atcaccggga 

ctgatggcaa 

ttgtggatta 

atcatgagac 

catgcagctg 

tccttgggtt 

gtgaaagata 

gatatcctca 

acctggactt 

ttccacactg 

caccgcttct 

tggactctgt 

attagggaga 

ggatttatgc 

atatttttcg 

cagaactgga 

ggatttggaa 

acaggcagaa 

atgttctata 



atgcagaaaa 
tgggtaggag 
cttcggaaat 
gtcgggcttt 
gagctccatt 
acggttcctt 
aacaagtgga 
tattttcagt 
ttgactgagg 
acaacatttt 
tcataatagg 
agaccttgca 
gatctgcaat 
atgggactct 
ccttcatgct 
atacaaagag 
ctgtggccat 
aagatcaatg 
tgcttcttgc 
tccacgatcg 
cttttcatgg 
ggggtcccac 
gaaggctatt 
taatgacttt 



agaagcagac 
aactgggaag 
ggtaaaataa 
gtggtgtctg 
tctggaaagc 
tgcagcacag 
agaaaagagg 
aaaatgtatg 
ctgtatcctt 
gcaaaatctc 
agtcagcgtg 
tagagcacct 
ttgtt^tccca 
gacttgcaaa 
cttctgcatc 
gctgaccttt 
ggcatttccc 
caccttccaa 
tctcatcctc 
aagaaaaatg 
tcctggagcc 
accacccacc 
ggtcttagac 
tctgtttcta 



gttttacatt 
ttgttgctta 
gaaaatgcat 
tggtatcatc 
cttacaagac 
aagcaatctc 
aagcatgact 
gatctatctt 
atcctccatc 
tcgcctctaa 
gtgggcaacc 
tactacttcc 
tttgtgttca 
gtgattgcct 
agtgtcacca 
tggacgtgtc 
ccggttttag 
caccgctcct 
ctagccacac 
aagccagtcc 
agtggccagg 
. ttgctgggca 
gagttcaaaa 
accttgtggg 



gggaattaat 

aaattttata 

gattctagag 

cgaccgtttg 

tgaggaatat 

tctccccatc 

gcagatcaga 

ttccttgttc 

catctatggc 

cagcctttct 

tcctgatctc 

tgttggatct 

actctgtcaa 

ttctgggggt 

gatacttagc 

tggctgtgat 

acgtgggcac 

tcagggctaa 

agcttgtcta 

agtttgtagc 

cagctgccaa 

tcaggcaaaa 

tggagaaaag 

gcccctacct 



gaaagcgtgt 

tcacctccac 

gcattcctaa 

gactggttag 

cagactgcga 

ttcgcatatt 

tcagttctct 

ttatatctag 

gaactatagc 

gaaactgact 

cattttgcta 

ttgctgttca 

aaatggctct 

tttgtcctgt 

tatcgcccat 

ctgtatggtg 

ttactcattc 

tgattcctta 

cctcaagctg 

agcagtcagc 

ttggctagca 

tgcaaacacc 

aatcagcaga 

ggtggcctgt 



60 

120' 

180 

240 

300 

360 

420 

480 

540 
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tattggagag 
tggatgagtt 
ctgaggcgct 
ccttactgtg 
tctgctgagc 
cagttttaag 
tttaaatctc 
aaccactgcc 
aagtatcagt 
agcattggac 
tttagaagtt 



tttttgcaag 
ttgcccaagc 
gtttcagcac 
ctatatgagg 
aattgtggcc 
gatttgggca 
agagtgatcc 
ctaagtttct 
gctaaatgct 
atcttaataa 
tgaagacttt 



agggcctgta 

aggaatcaat . 

aacccttcl t 

gagcatctgt 

catagccata 

acattctgca. 

tgctgactgc 

ttatgtggtc 

ctgtatgtca 

attaagttga 

aaaacag 



gtaccagggg 
ccttttgtct 
tactgcagaa 
aaatctttag 
ttttgagaag 
gtctttgcaa 
cagcaaaggt 
aaaaactaga 
ctacatatga 
catgaggtaa 



gatttctaac 
gcattttctc 
aatccaggtt 
ccttgtgaaa 
aaattcaaga 
tagttcacct 
ttgtaattaa 
taatgaaagt 
aaaaacatca 
atgtgttgat 



agctgctgtc 
aaacagggag 
accaagggaa 
actaaccttc 
atggaatcag 
ataatcctat 
gaagggactg 
agcaggtgct 
aaaaacaatt 
aaaaactaat 



<210> 26 
<211> 1165 
<212> DNA 

<213> Homo sapiens 

<22o> ; 

<221> misc_feature 
<223> mcyte ID No: 



1876283CB1 



<400> 26 

cttcttccgg 

ggtcctggtg 

gcacgatacc 

cctgctcctg 

ctacgtgaat. 

tcctccagcc 

tcggcactgc 

ggagaactgt 

ggtggtgctt 

gggtctgtgg 

ctcgttggtg 

cacctgggaa 

cttcgaccga 

ctgggagacc 

ctggaggccg 

agcgctcaga 

cttcaagtca 

agaaaaggac 

ccaggctgct 

agacaaacca 



gtggggcccc 

cggaccgggc 

gagctgcggc 

gtgctgggct 

gtgcagcccc 

atccctcttc 

cgtgagtgcc 

gtgggagagc 

ctgtggggcc 

ttgcggtcca 

gccagcctgc 

ttcatctcct 

ggcctgaccc 

ctctgggctg 

ggctaccgtc 

gggcctgggg 

gacagatccc 

ctgtgggtgg 

gccagtgcac, 

taaaaataaa 



gggccgaggc 
acaccgtgct 
aatgggagga 
ccctgctgct 
agcctcagga 
ggcgctgcag 
gccgttgcgt 
gcaaccaccc 
tgtacctggc 
gcgggctcct 
tcctcgtctc 
cacaccgcat 
gcaacctggc 
aggaggagga 
ttgtgcctga 
ccctcactcc 
tgccttggtg 
ctcaggccca 
acttttacaa 
aaaaa 



gatggcgccc 
gacctgggga 
gcagggggag 
ctacctcgct 
ggagrctcaaa 
atactgcctg 
ccgccgctac 
actctttgtg 
atggtcaggc 
gttcgccacc 
gcacctctac 
cgcctatctc 
ccacttcttc 
agagggcagc 
aaaccacggg 
tgcccacgcc 
ggcagttctg 
agcagacccc 
atttaatata 



tgggcgctcc 
atcacgctgg 
ctgctcctgc 
gtgtcactca 
gaggagcaga 
gtgctgcagc 
gaccaccact 
gtctacctgg 
ctccggttct 
ttcctgctgc 
ctggtggcca 
cgccagcgcc 
tgtggatggc 
agcccagctg 
gcctgtcccc 
tcccagaccc 
ccttccaagg 
gggctccacc 
aagcaagtcc 



tcagccctgg 
tgctcttcct 
ccctcacctt 
tggaccctgg 
cagccatggt 
ccctgagggc 
gcccctggat. 
cgctgcagct 
tccagccctg 
tgtccctctt 
gcaacaccac 
ccagcaaccc 
cctcagggtc 
tttagggttg 
agctggggtg 
cagaacggag 
aagaagggga 
ccagccccgc 
agtcttaaaa 



<210> 27 
<211> 3523 . 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2470285CB1 

<220> 

<221> unsure 

<222> 3393, 3419, 3435, 3443 
<223> a, t, c, g, or other 



1500 
15 60 
1620 
1680 
1740 
1800 
1860 
,1920 
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2040 
2067 



60 / 
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240 

300; 

360 
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1020 

1080 

1140 

1165 



<400>-27 

gcggccgcag 

ggctcggccg 

ggcgaccgtc 

gccttcaccc 

tttaagaact 

cacccaaagg 



caatgccggg 
ggcccagcgg 
gggtggactg 
aagcgctgga 
ttccttttct 
ccttgtctgg 



cccgctaggg 
cgcggcgccg 
ctccgggaag 
tatcagtatg 
agaagagcta 
gttgaaagaa 



ctgctctgct 
cctctctgcg 
gggctgacgg 
aa'caacatta 
caattggcgg 
ctcaaagttc 



tcctcgccct 
cggcgccctg 
ccgtgcccga 
ctcagttgcc 
gcaacgacct 
taacgctcca 



ggggctgctc 60 
cagctgcgac 120 
ggggctcagc 180 
agaagatgca 240 
ttcttttatc '300 
gaataatcag 360 
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ttgaaaacag' 
gatgccaacc 
catctgtggc 
cccaccctac 
tttaccaacc 
agtcaacact 
ttgggggaat 
agtaattcta 
. atacatttgt 
gatcttcatt 
ggaactgtcc' 
aatttgtgcc 
gaccttccaa 
atctaccaaa 
agtagaaacc 
aacctagatg 
aatcaactga 
gttaacctca 
gactcttatg 
gagaaaggta 
caaataatta 
ggaagctgga 
ctgcttgtta 
ataggcttga 
cttgatgctg 
ggctgcaaag 
atgctagcaa 
aatcatctca 
ggctgttttc 
cctacaggtg 
gcatttttat 
ctctcagaaa 
tgcatctttt 
atcagccccg 
aatccagtcc 
cgacgtgtta 
ctggaacagg 
gtttgcgact 
aaatcacaca 
tccgactgtg 
tcagacagtt 
ttccctttgg 
gtaaccgttt 
tcatctggga 
agtttatttc 
aaatgcaatg 
cttaggtgta 
acctatttgt 
tttaaaatgt 
tccttaatct 
tgagagtata 
actataaatt 
tacagctgtt 



tacccagtga 
atattacctc 
tggatgacaa 
aggcgctgac 
tttcaagcct 
gttttgatgg" 
ttcctcaggc 
tttctgttat 
atgataatcc 
ccctagtcat 
acctggaaag 
aagaacaaaa 
gttttaatgg 
taaaggaagg 
tgatacatga 
taagtttcaa 
aacttgtggg 
ggitctttatc 
caaatttaaa 
ctgctgatgc 
tccattgtac 
tgattcgtct 
ttttaacaac 
tttctgtgtc 
tgtcctgggg 
tagctgggtt 
ctgtcgaaag 
aacagttccg 
cccttttcca 
aaacgccatc 
taatggccgt* 
actcacaatc 
tctgccctgt 
aaataatgaa 
tgtatgtttt 
ccaagaaaag 
atttctacta 
gctgcgaatc 
gctgtcctgc 
gcacacagtc 
ctgaccaggt 
tgcgctatgc 
cccccgtcaa 
agcacttctg 
tcaaaccagt 
tccaagcaat 
gtagagcaat 
gtattaagca 
gattttctat 
caggacaact 
cataggcatt 
ttgtntaagc 
aggttaaaaa 



agccattcga 
agtccccgag 
cagcttgacg 
cctggctctc 
ggtagttctg- 
actagataac 
tattaaagcc 
ccctgatgga 
tctgtctttt 
tcghggtgca 
tctgactttg 
gatgcttagg 
ttgccatgct 
cacctttcaa 
aattcacagt 
tgaattaact 
caacttcaag 
agtaccatat 
cacagaagat 
agcaaatgtc 
accttcaaca 
tactgtgtgg 
atttgcatct 
taacttattc 
cagattcgct 
tcttgcagtt 
aagcttatct 
ggttgctgcc 
tagaggggaa 
attaggattc 
tatctacact 
tagcatgatt 
ggcgtttttt 
gtctgttact 
cttcaaccca 
tggatcagtt 
cgactgtggc 
gtttctttta 
attggcagtg 
ggcccactct 
gcaggcctgt 
ttacaatcta 
ccaaaatcag 
taatcactgc 
cattttcaaa 
gtatgatctg 
ataatgttag 
caagataaag 
aactgaagaa 
tactgcaggg 
actttattat 
canttataaa 
atagctggac 



gggctgagtg 
gacagttttg 
gaggtgcctg 
aacaagatct 
catcttcata 
ctggagacct 
cttcctagcc 
gcatttgatg 
gtggggaact 
agcatggtgc 
acaggtacaa 
actttggact 
ctggaagaaa 
ggcctgatat 
agagcttttg 
tcctttccta 
ctgaaagaag 
gcttatcagt 
aacagcctcc 
acaagcactc 
ggtgctttta 
ttcattttct 
tgtacatcac 
atgggaatct 
gaatttggca 
ttctcctcag 
gcaaaagata 
cttttggctt 
tattctgcat 
actgtaacgt 
aagctatact 
aagcatgtcg 
tcatttgcac 
ctgatatttt 
aagtttaaag 
tcagtttcca 
atgtactcac 
acaaagccag 
gcttcttgcc 
gattatgcag 
ggacgagcct 
ccaagagtta 
tgtttataga 
ctggtgtcac 
gaacaggtgc 
tttgaaacaa 
ttttttctga 
aacagctgtt 
aatatcttgc 
ccaaaaaagg 
gtnttcactt 
tctaaacctg 
attggtttca 



ctttgcagtc 
aaggacttgt 
tgcaccccct 
caagtatccc 
acaataaaat 
tagacttgaa 
ttaaagagct 
gtaatccact 
cagcatttca 
agcagttccc 
agataagcag 
tgtcttacaa 
tttctttaca 
ctctaaggat 
ccacacttgg 
cggaaggcct 
ccttagcagc 
gctgtgcatt 
aggaccacag 
ttgaaaatga 
agccctgtga 
tggttgcatt 
tgccttcgtc 
atactggcat 
tttggtggga 
aaagtgccat 
taatgaaaaa 
tcctaggtgc 
cacccctttg 
tagtgctatt 
gcaacttgga 
cttggctaat 
cattgatcac 
ttccattgcc 
aagactggaa 
tcagtagcca; 
atttgcaggg 
tatcatgcaa 
aaagacctga 
atgaagaaga 
gcttctacca 
aagactgaac 
gtgaacccta 
ttagaagaag 
ctaaattata 
atatatgact 
tccataagaa 
aatatttttt 
taattttacc 
gactgtccca 
gccatccttg 
gtagatgttt 



tttgcgttta 
tcagttacgg 
cagcaatctg 
tgactttgca 
tagaagcctg 
ttataataac 
aggatttcat 
cttaagaact 
caatttatct 
caatcttaca- 
catacctaat 
taatataaga 
gcgtaatcaa 
tctagatctg 
gccaataact 
gaatgggcta 
aaaagacttt 
ttggggttgt 
tgtggcacag 
agaacatagt 
atatttactg 
atttttcaac 
caaattgttt 
cctaactttt 
aactggcagt 
atttttatta 
tgggaagagc 
tacagtagca 
tttgccattt 
aaactcacta. 
aaaagaggacv 
cttcaccaat 
tgcaatctct 
tgcttgcctg 
gttactgaag- 
aggtggttgt 
caacctgact 
acacttgata 
gggctactgg 
ttcctttgtc 
gagtagagga 
tactgtgtgt 
ttctcatctt 
gagaggtggc 
aattggtgaa 
tgaaaaggat 
gcaaatttat 
aaaaatctat 
taatgtttca 
gccagaactg 
acataagang 
taaaccatat 



420 
480 
540 . . 
600 : 
660 " 
720 
780 
840 
900 

-960- v 
1020 \ 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280* 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3523 



<210> 28 

<211> 2179^ 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



P2925789CB1 



<400> 28 
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gtcctggcga 
tctcgctgcc 
ccaaggagga 
ggagcagaca 
agttaagata 
tcacagcttg 
gcgcacagaa 
aatgggtaca 
tcactggggt 
tgactctgtg 
gttgataaat 
gggctttgtc 
ttgtaaagtc 
atgtgtgatg 
tcatgcttcg 
catttttgtg 
agtgaagatc 
agctgcattt 
tgggatgttg 
atggattgaa 
ttttccatat 
ggtccctgaa 
aacaatagaa 
aatagaagat 
aagtccctgc 
aagaggttta 
tgtttcaaga 
tgaaacagat 
ttatccttta 
tgagcctact 
tatttttatt 
cctctttacc 
gcattttctg 
tccccccttg 
tatatgatat 
tgttacctgt 
ggtataaaaa 



gggcgctggc 
agggtttgct 
aaaaatctgt 
ccagggacca 
aagagctttg 
aaggcatgac 
aaaggaaatc 
ttctgttcgg 
cccatcatag 
ttgtggtatt 
tggactgtca 
cctctggggt 
tgccaagcat 
aagatggacc 
ttcacactgt 
atgactatgt 
gacatgagtg 
gctaccacct 
ttttttatcc 
gagaaggcta 
gatatgggaa 
ggagatggac 
cagttgaaac 
tatagtggtg 
accgaagagc 
cgatactggt 
ataaggggtt 
caagccccag 
agtctgctta 
ctggttaaga 
accattttga 
acctcttcct 
taggattttc 
ccaggagcat 
tctagatact 
cttttaaaaa 
aaaaaaaaa : 



cgagaggtgc 
ctctgcttgt 
tccgggtgag 
cactgaggtt 
ggtgcctgac 
aattaaagaa 
tataattctt 
ttatcaagtt 
cccttggtgt 
ggcccttaca 
tgattcttta 
ggaaaccgga 
acaaggcacc 
atcactgtcc 
ttctcctttt 
acacacagct 
cagcccggag 
. tgtttgcctt 
agatgaaaat 
aagatcgaat 
gtagatggag 
ttgagtggcc 
aaaaagcaga 
cctgctgccc 
ctcgaataca 
tatatggaga 
ggttccctag 
agggggagaa 
attacttgaa 
tgttcttttc 
aatggagaag 
ccccctcaaa 
ttatgcactg 
ctgcatgtgg 
atagaactca 
tgcagatttt 



tcggcttgta 
cctgggctga 
cccaggccgc 
gggtttcaga 
agtgaaaatg 
cacacatgga 
ttaaagtagg 
tgaaaatcta 
tatagcaata 
tacaactgga 
taattacttc 
aatttctcag 
acgttcacat 
ttggatcaac 
agcaccactg 
ttatcatcgg 
agatcctctt 
gggattagct 
aattctcaga 
tcagtattat 
gaactttaaa 
agtaagagaa 
taagagagtc 
tctgaataaa 
gctgcaaaaa 
caaaattctt 
aaaatgtgtg 
gaaaaataga 
aattgtacat 
ctcaaaggtg 
ccattctgca 
ggaaaaacat 
cacactctgg 
tacttctctt 
atttgtcaga 
gtcaaatcaa 



gcaggtcccg 
ggtgtccatg 
cccggatatg 
ccaagacact 
gtgtaatctg 
cttgtggcac 
aaggcattct 
caagaattaa 
tgttctacca 
ggaagtgtga 
aatgccatgt 
gataccatgt 
cactgcagaa 
aactgttgtg 
ggttgtatcc 
ctctcctttg 
ccaattgttc 
ttaggaacaa 
aacaaaactt 
caactagatg 
caggtattta 
ggctgtcacc 
agaagtgttc 
ggaatcaaaa 
ggggaattca 
gatgattcct 
gaaaagtgtc 
tagctgctgt 
attactaaag 
ccctagtgcc 
tatgcctttg 
ttcatccaag 
acctcacctg 
ttccctcagt 
ttcagtataa 
ataaagatca 



cactccagcc 
acggagtcat 
cgatggctga 
ggattctcct 
cgttaacagt 
atggaaatgt 
tccttgccaa 
agagactgtg 
tggccatgat 
atttcatcat 
ttgtcggtcc 
atctccagta 
agtgtaacag 
gttaccaaaa 
atgctgcttt 
ggtggaacac 
catttggatt 
ccatagctgt 
ctattgagtc 
aagtctttgt 
cgtggtcagg 
aatacagctt 
gctataaagt 
ccttcttcac 
ttttagccac 
ttatagaagg 
cctgtgatgc 
taaaacaaaa 
aattatgcaa 
atgatttaaa 
aattcctgcc 
taagttaacg 
cagatacagt 
tgatatttct 
cctcagattt 
atggatgttg 



<210> 29 . 

<211> 645 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3099990CB1 



<400> 29- 
cggccaaaga 
acacagcggc 
ggcgaggaga 
ctcccctaca 
aaggtcaagt 
.aagaatgcgc 
aagtgccagg 
gaccggcagg 
acggctgatg 
ccagggaccc 
cctcccaagg 



tctcagcctt 
gtggcatgaa 
ccctgcgtgt 
tcggccgcat 
ggttctacca 
tgtaccagtc 
tcgtggcgcg 
acctctacta 
gcgtgcccat 
tgtgtgcgga 
gcgcatctga 



cctgcccgcc 
ggggaaggcc 
cggggactgt 
cgagagcatg 
ccctgaggag 
ctgccacgag 
cgagcagtat 
cctggcgggc 
cctatgctga 
cctggcgtcg 
gcaaatatgc 



cggcagctct 
cggaagctgt 
gccgtcttcc 
tgggagtcgt 
accaagctgg 
gatgagaacg 
gagcagatgg 
acctacgacc 
gccgcccacc 
gccaagccac 
aaaagcccac 



ggaagtggtc 
tctacaaggc 
tgtcagctgg 

ggggcagcaa 

gcaagaggca 
acgtgcagac 
cccggagccg 
ccaccaccgg 
gcagatgcct 
cgggcaggag 
agggc 



ggggaatccc 

catcgtgcgg 
gcggcccaac 
catggtggtc 
gtgcgacggc 
catctcccac 
caagtgccag 
gcgcctggtg 
cccacgtgcg 
gcagccccgg 



<210> 30 

<211> 627 

<212> DNA 

<213> Homo sapiens 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620) 

1680: 

1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2179 



60 " 

120 

180 

240 

300 

360 

420 

480 

540 

600 

645 
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■ <220> .■■ / r : " > r'-y-; ^ / . 

<221> misc_feature 
<223> Incyte ID No: 103561CB1 



<400> 30 

cggactcaga 

aaatcacggg 

cttcccggcg 

ccgatcaatg 

actttgagtt 

gaccgttttc 

aacgtaaatc 

acatccagtt 

cctcgtctgc 

tttgtgacag 

ctctggaata 



agggacttcc 
gtctcccgcg 
ctttcctggg 
caatgtcccg 
tcecattggg 
tatcatctgc 
atgtcgtaat 
cggatcccaa 
cacatgcatc 
tgagttgaaa 
ataaaaatct 



ctgctcggct 
ccgctcatgg 
ttgcttctgg 
gaatggcttc 
acatatctga 
ctaaaaaact 
cctccagatc 
attaaatatt 
atctcaggca 
tatgcattcc 
taaccga 



ggctttcggt 
cgcctcccgt 
cggccctggt 
catttgccag 

^acta_tgaatg 
cagtctggac 
ctgtgaatgg 
cttgtcctaa 
acactgtcat 
tatttctttt 



ttctctgctc 
ccgtctcgag 
gttgctgctg 
gcctaccaac 
ccgccctggt 
aagtgctaag 
catggcacat 
aggataccga 
ttgggataat 
accgatacat 



acctccggat 60 
cgtccctttc, 120 
tcctccttct 180 
ctaactgatg 240 
tattccggaa^ 300 
gacaagtgca .3 60 
gtgatcaaag 420 
ctcattggtt 480 
aaaacacctg 540 
tctaattttt 600 
627 



<210> 31 

<211> 1858 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc.feature 

<223> Incyte ID No: 288709CB1 



<400> 31 

ctggatgagc 

gtgtggccag 

tttccaatgg, 

agaagpaagt. 

ggaatcaggc, 

cagggatggg 

ttctgcatga 

tgatctctgc 

ccttctccca 

gccagaatgg 

ccagcaccat 

atgtgcagga 

tgctgtgcct 

gcacctcact 

tggggattga 

atctctacct 

cacggaacct 

tcccagtcgg 

acctttatgg 

tccttggccc 

ggattttgaa 

ggatgctggc 

gcttgctaca 

gcctccaagg 

aatatcaaaa 

tttccagcaa 

tgaagagaaa 

ctctttggca 

tggagaggaa 

acaaatgtgt 

tggaaaccca 



tgctggaggc 

tcacctgctg 

gctgttgaac 

agacaggagt 

acagaaatct 

caagttgctg 

gacacaccag 

ctttctgagc 

ccgttcagtg 

atgtggtcac 

ctgccgttgc 

ggaggatccc 

cctcctggcg 

gcatctgcag 

tcgaactgaa 

ggccgccttc 

gacagtggtc 

ctatggcgtt 

aactgctgat 

agtctgtgcc 

aagaaaactt 

tttcaaagca 

ggtgggtcca 

cttcttcatc 

gtggtttaga 

gatgggtcct 

atattaaaac 

ttatgaagaa 

gtaatcaacc 

ggtaaattgc 

aagcatgacc 



cctggggacc - 

gatggcctag 

ttcagttatc 

gtcaccttga 

ggtgacccag 

gctgaggccc 

ggcttgctgc 

aacaacgaca 

atcccgagac 

tgggccacca^ 

acccacctga 

gtgctgactg 

gccctcactt 

ctctcgctct 

cccaaggtgc 

acctggatgc 

aactactcaa 

cccgctgtga 

cgatgctggc 

attttctctg 

tcctccctca 

acagctcagc 

gctgcccagg 

ttcttggtct 

gagatcgtaa 

gactcaaaac 

tagaatattc 

tgaagctaag 

tttacttccc 

atttctcttc 

actgcaaata 



tggagaccct 
aggatgtcct; 
ctgcaggcac 
gacagaatca 
gcccttctgt„ 
ctctggtcct 
aggacggctc 
cccaaaacct 
agaaggtgct , 
caggctgcag 
gcagctttgc .., 
tcatcaccta 
ttctcctgtg 
gcctcttcct 
tgtgctccat 
tgctggaggg 
gcatcaatag 
ctgtggccat 
tccacctgga 
cgaatttagt 
atagtgaagt 
tcttcatcct 
tcatggccta 
actgcctcct 
aatcaaaatc 
ccagtgaggg 
aactccatat 
gaaaagggaa 
aagctgtttg 
actatggtgt 
tttccttgac 



gccccgctta 
cagaggcctg 
agaattgtcc 
ggcagtgatg 
ggtgggcctt, 
ggaacctgag 
ccccatcctg 
cagctcccca . 
ctgtgtcttc 
cacaataggc 
cgtcctcatg 
catggggctg 
taaagccatc 
ggcccacctc 
catcgccggt 
tgtgcacctc 
actcatgaag 
ttctgcagcc 
ccagggattc 
attgtttatc 
gtcaaccatc 
gggctgcaca 
cctcttcacc 
cagccagcag 
tgagtctgag 
ggatgttttt. 
ggaaaatcat 
ttcattaaac 
ttctccacaa 
attcagtcaa 
tttttgtaaa 



cagcagcact 
agcaagaacc 
ctggaggtgc 
cagctcgact 
gtctccattc 
aagcagatgc 
ctctcagatg 
gttaccttca 
tgggagcatg 
accagagaca , 
gcccactacg 
agcgtctctc 
cagaacacca 
ctcttcctcg 
gctttgcact 
ttcctcactg 
tggatcatgt 
tcctggcctc 
atgtggagtt 
ttggtctttt 
cagaacacaa 
tggtgtctgg 
atcatcaaca 
gtccagaaac 
acatacacac 
ccaggacaag 
atccatggat 
atatcatcct 
taggctctca 
tgcttgtccc 
aaaaaaaa 



60 - 

120 

180 

240 

300 

360 

420 

480 

540 

600 - 

660 

720 

780 

840 

900 

960 

1020 4 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800. 

1858 



<210> 32 

<211> 539 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_ feature 

<223> Incyte ID No: 959893CB1 / 

<400> 32 

tttttggctg agaaggctgg gtctacattt eaggccacat ttggggagac gaatggagtc 60 
attcctggga ggtgttttgc tgattttgtg gcttcaagtg gactgggtga agagccaaaa 120 
gatagaacag " aattccgagg ccctgaacat tcaggagggt aaaacggcca ccctgacctg 180 
caactataca aactattccc cagcatactt acagtggtac* cgacaagatc caggaagagg 240 
ccctgttttc ttgctactca tacgtgaaaa tgagaaagaa aaaaggaaag aaagactgaa 300 
ggtcaccttt gataccaccc ttaaacagag tttgtttcat atcacagcct cccagcctgc 360 
agactcagct aactacctct gtgctctagg gggaagagga acaaactcac ctttgggaca 420 
ggcactcagc tagaagtggg actcaatatc cagaaccctg accctgccgt gtacaagctg 480 
agagactcta aatccagtga caagtctgtc tgcctattaa cggttttgat tctcaaaaa 539 

<210> 33 . . 

<211> 1126 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ feature 

<223> Incyte ID No: 1414179CB1 

<400> 33 » 
tgccatctta ggggcgcctg • gcgctacggg tttctcgttg gaggcggcct tcgtggcagc 60 

tgtagacgcc gggaaaaggc ataaagtccg ttggccgaca cctttctttc ctccggcctc 120 r "^rc 

ggtagaaccg ccagcccgcg tccgaaggcg gaggcgaggg gaactggccg cgtgaggggc 180 ^ . : i*T s 
ctgaggcgag cggttagagc gtctcccgga aggatgggcc ggtctcggag ccggagctcg 240*. ^ 
tcccgctcca agcacaccaa gagcagcaag cacaacaaga agcgcagccg gtcccggtcg 300' r 
cgatcccggg acaaggagcg cgtgcggaag cgttccaaat ctcgggaaag taaacggaac 360" ^ 
cggcggcggg agtcgcggtc ccgttcgcgc tccaccaaca cggccgtgtc ccggcgcgag 420V #»'^ 
cgggaccggg.agcgcgcctc gtccccgccc gaccgcatcg acatcttcgg gcgcacggtg 480 ; 
agcaagcgca gcagcctgga cgagaagcag aagcgagagg aggaggagaa gaaagcggag 540- L \t 

ttcgagcggc agcgaaaaat tcgacagcaa gaaatagaag aaaaactcat cgaggaagaa 600 l ^ 
acagcacgaa gagtagaaga attggtagca aaaagggtgg aggaagaact ggagaaaagg 660 
aaggatgaaa ttgaacgaga agttctccga agggtggagg aagccaaacg catcatggaa 720 
aagcagttgc tcgaagaact cgagcgacag agacaagctg agcttgccgc acaaaaagct 780 
agagaggagg aagaacgtgc aaaacgtgag gagctagagc gaatactgga agagaataac 840 
cgaaaaattg cagaagcaca agccaaactg gccgaagaac agttgagaat tgttgaagaa 900 3 
caaagaaaga ttcatgagga aaggatgaaa ctagaacaag aacgacaacg tcaacaaaaa 960 
gaagaacaaa aaattatcct gggcaagggg aagtccaggc caaaactgtc cttctcatta 1020 
aaaacccagg attaaattgc aaactctgaa ctttttacaa agaaaaatgg aaaaactttg 1080 ■ - * 
tatggtagct tcatgttgaa gtggtttttt gtttttggtt ttgttt 1126 

<210> 34 \ 
<211> 490 
<212> DNA 
<213> Homo sapiens 

<220> \ . * • * 

<221> misc_f eature . " 
<223> Incyte ID No: 2197211CB1 

<400> 34 

gcctctgaga aaagaaggtt ggaattatcg taat ttgttt ctaggctgag ataccagcat 60 - 

ggagaaaatg ttggagtgtg cattcatagt cttgtggctt cagcttggct ggttgagtgg 120 ■ -\ 

agaagaccag gtgacgcaga gtcccgaggc cctgagactc caggagggag agagtagcag 180 

tctcaactgc agttacacag tcagcggttt aagagggctg ttctggtata ggcaagatcc 240 

tgggaaaggc cctgaattcc tcttcaccct gtattcagct ggggaagaaa aggagaaaga 300 

aaggctaaaa gccacattga caaagaagga aagctttctg cacatcacag cccctaaacc 360 

tgaagactca gccagttatc tctgtgctgt gcaggggggc attgggaatg tgctgcattg 420' ♦ 

cgggtccggc actcaagtgg ttgttttacc acatatccgg gaccctggac ctgccgtgta 480/ 
acagctgaga 490 
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<210> 35 
<211> 1799 
<212> DNA ■ ' 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2263653CB1 



<400> 35 

ctaggccggt 

gctggtggtg_ 

cgcgcgctgg 

gcttcgggcc 

ccttagtgag 

ccccctcacg 

ggactttgcfc 

gggaccagcc 

catcaagatg 

ctctgtctgc 

gcgggaggag 

agagccactt 

ccgcgtggga 

gcggctggcc 

gttcctcctg 

cattgcacgg 

cgattctgcc 

ggcggtgacc 

gctgcgaagg 

ctactgctat 

ctgcacactt 

actatccccc 

ccagcagact 

ccgtggcgtc 

tttcggcctc 

ggccctgagg 

cgtgtcccca 

acccgcggcc 

agcatgaggg 

gtgatgctgg 



gggtgagtgc 

accctgctca 

ctgctctgta 

ctggcgggga 

gagaagccac 

accgtggatg 

gtgtactcgg 

aaggagacta 

ttcctgacag 

ctcacctttg 

accctcgagc 

ctgaagaagc 

ctggcagtgg 

cagacccacc 

cacaccagct 

gacttcctgc 

ttcgactctg 

cggccccacc 

gaggctggcc 

gtgaccgtgg 

ctgctcaaga 

gacccatcct 

gcagcgcgga 

ctggcctacc 

tacttccacc 

tctgttcctg 

gctgcaaggt 

ccccttggac 

tcccgaggcc 

ctgctcttct 



accgcgttct 

ctgccaccct 

acggcagttt 

agccgaggcc 

tgtctgtgcc 

ccctggtcct 

gcggcgtgta 

acattgctgt 

tgacacggct 

ccttcctctt 

tgggcctgga 

agggctggga 

tgggctctgt 

gggacgcact 

tcctgtctcc 

accagccgcc 

ggcgcctctg 

tgcaggccta 

gcatcgaagc 

tgagcttgca 

cgctgggagg 

cagccagcgc 

ttgccggggc 

tcatctggtg 

agcacttggc 

gggcagcggg 

ggggccggac 

gccgagtttc 

attgtctccg 

gaacaaataa 



cgcacgcgtc 

catgcacagg 

gttccgatac 

cagaggcagg 

ccgagatgcc 

gcgcttcttc 

cctcttcaca 

gttctggtgc 

gtacttcagc 

cctgctgctg 

gcctggtctg 

ctgggcgctt 

gctgggtgcc 

gaccatgtcg 

cctgttcatc 

gtttggggag 

gttgctggtg 

cctgtgcctg, 

ccgtgaaatc 

gtacctgacg 

ctattcctgg. 

tgcccccatc 

tctgggtggc 

gacggctgcc 

aggctcctag 

acactagcct 

tccccggcgt 

tgcctcagaa 

aagcgtatgt 

aggagcatgc 



atggcggtcc 

ctggcgccac 

aagcacccgt 

aaagagcggt 

ccgttccagc 

ctggagtacc 

gaggcctact 

ctgctcacgg 

gccgaggagg 

gccatgctgg 

gccagcatga 

cctgtggcca 

ttcctcacct 

gaggacagac 

ctgtggctct 

acgcgtttct 

gtgctgtgcc 

gccaaggccc 

cagcagaggg 

ccgctcatcc 

ggcctgggcc 

ggctctgggg 

ctgcttactc 

tgccagctgc 

ctgcctgcag 

gccccctctg 

tcccttcacc 

ctgtctctcc 

gccaggtttg 

cgatttttaa 



tcggagtaca 

actgctcctt 

ctgaggagga 

gggccaatgg 

tggagacctg 

agtggtttgt 

actacatgct 

tgaccttctc 

ggggtgagcg 

tgcaagtggt 

cccagaactt 

agctggctat 

tcccaggcct 

ccatgctgca 

ggacaaagcc 

ccctgctgtc 

tgctgcggct 

gggtggagca 

tggtccgagt 

tcaccctcaa 

cagctcctct 

aggacgaagt 

ccctcttcct 

tcgccagcct 

accctcctgg 

tttgcgcccc 

acagtgcctg 

tgggcccagc 

agtggcgagg 

aaaaaaaaa 



60 

120 

180 

240 . 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1799 



<210> 36 
<211> 1435 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2504590CB1 



<400> 36 

catagcgcga 

gcctgcctca 

aagatccttt 

gcgcgatctc 

gctcccgagt 

tttcagtaga 

tgatccgccc 

gccctcaaag 

tgcaaacgcc 

atgatgctgt 

ttactgcttt 

ttttaccctt 

tgtttaacct 



ggttcaccat 
gcctcccaaa 
tttttgttat 
aactcactgc 
agctgggact 
gacgggatat 
tcctcagtct 
attgttatga 
cggaagactt 
ggatgagcat 
gccctgctga 
cctaggacct 
ccttgtaaag 



gttggccagg 
gtgctgggac 
cagatggagt 
aagctctgcc 
acaggcgccc 
caccatgtta 
cccaaagtat 
gtcctcacag 
aaagctattg 
gaaagttttt 
tggaattcag 
ttaatagata 
tttgtgtctt 



ctggtcttga 
tacaggcatg 
ctcactgtgt 
tcccgggttc 
gccaccactc 
gccaggatgg 
taggattaca 
atgtcagagg 
ttttgcaaat 
ctcttactcc 
taaaggaatc 
tctttttatt 
ccagattaca 



actcctgtac 
agccactatg 
ccccaggctg 
acgccattct 
ctggctaatt 
tctcgatctg 
ggcgtgagcc 
acgatgctgt 
gcacagataa 
taattataat 
tctaccagca 
actaatcttt 
ataattccat 



cttgtgaaat 
cccaggctca 
gagtgcagtg 
cctgcctcag 
ttgtttgtat 
cctgacctcg 
accgcccccg 
gacctatatt 
caataataac 
gttttccctc 
gacacttggt 
gctctttgct 
gtaaagatga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720„ 

780 
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tgctggcaca 
ctttgagcct 
atgcccataa 
ccttaagatt 
gacttatttt 
gtggagaagc 
gcttatgagc 
cacaccttgg 
cccagaaagt 
acaactagct 
ctctgcaaga 



aggctttcaa 
tttagaacag 
catcagcagg 
aaggaggagt 
ccattcccaa 
ctgctgaaac 
ataaagacac 
caatggcctg 
tctgaataac 
agtagcccac 
agtagcacca 



cccatcccct 
gtatccaggg 
aagcagttac 
atataatctc 
ccccattgaa 
cagcagatga 
taccactggg 
gaagttactt 
ctacctctta 
aggcaccaac 
gttcaataaa 



cttctgaccc 
attttacctc 
agaagatgaa 
tgatggggaa 
cagagcagga 
tgatgaaagt 
accatggcca 
tatatggttc 
attggcatgc 
tctgggcaca 
agttgctttc 



agaagataaa 
tccagtgcta 
cctccgccct 
atgaggtagg 
tctggtcaaa 
gacctctagt 
gtttacaaat 
tggaaactcc 
aattaaaagt 
ctgcctatag 
tctcaaaaaa 



gacatcctac 
ggcagggtct 
tctgcaagcc 
agaccagaag 
acagggtgca 
tgccctcact 
gtcatggcaa 
ctgccccttt 
ggctctaaat 
gttagccctg 
aaaaa 



<210> 37 

<211> 1792 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2529619CB1 



<400> 37 

acacgcgtcc 

atcaacttgc 

atcaattgtt 

ctcagttctt 

tctgactcca 

caaacaggac 

cagacaatgg 

ctcctctttg 

ctggggctca 

ctggccgtcg 

ctgcatccag 

agttttgcac 

atctgccacc 

atcacttcct 

ctgccctttt 

aggctggcct 

atcctggtgg 

ctgaggatcc 

tgcgtagtgg 

catcctgagg 

aaccccctga 

caagaaaagt 

actcttgatg 

tactgacaga 

ttcattgggt 

actttaatat 

ggaaaccatc 

tcccctttct 

tctgctcata 

ccattagttt 



ggagctaacc 
tacactgtgt 
atcaacctct 
catggtccac 
taagacaact 
tactgttctg 
tcacagagtt 
ggctcttctc 
tctcactgga 
tcaacatcgc 
ccaagcccat 
atactgaatg 
etctccgata 
ggacatgtgg 
gtgggcctcg 
gtgctgatac 
gaccactctg 
agtctgggga 
gactcttctt 
agcagcagaa 
tttacaacct 
cattcctaag 
cccaattatt 
agggcaagtc 
ttctatgaga 
ctttgaccag 
ttggagaggg 
tggagggtca 
catctgtcat 
gtatagagct 



aggcacatct 
tcctccagtg 
agcagattat 
gatagttctt 
ttgcaaagtc 
aaagaatact 
cctcctactg 
cctgttctat 
ctccagactc 
ctatgcctgc 
ctcctttgct 
cctcctgttg 
tttcatcatc 
ctccctcctg 
tgaaatcaac 
ctggctcaac 
cctggtgctg 
gggccgcaga 
tggcagcgcc 
ggtccttttt 
gaggaatgta 
aggtgtgaca 
gcctcaatcc 
ttctctcgtt 
tgtggtttta 
caaatatctc 
tcctgtcttc 
cttggaacca 
gtaacacttt 
gacttagtat 



tcacaatgaa 
cccagttcac 
tgaatagaeg 
atatccagaa 
ttggctcata 
cagactatga 
ggatttctcc 
gtcttcaccc 
cacaccccca 
aacacagtgc 
ggctgcatga 
gtgctgatgt 
atgacctgga 
gctatggtcc 
cacttcttct 
caggtggtca 
gtctcctact 
aaggccttct 
atcgtcatgt 
ctattttaca 
gaggtcaggt 
tttgaactgc 
agaaaagttt 
ttttgcagat 
tcagacaatt 
caccctccag 
ccctgaggtg 
gctaactttt 
agtgttctat 
ttttcggacg 



atcttcacaa 
aggaagcact 
gtcgttacat 
aacgtactta 
actataaatc 
gtagtgaaat 
tgggcccaag 
tgctggggaa 
tgtacttctt 
cccagatgct 
caactacctt 
cctacgatcg 
aagtctgcat 
atgtgagcct 
gtgaaatcct 
tctttgcagc 
cacacatcct 
ccacctgctc 
acatggcccc 
gttctttcaa 
gccctgagga 
cagcctcagt 
acttctcttt 
aaaattttag 
ttttctttta 
ggagaggtag 
ggctctgaat 
cagggtcctt 
atttgcatag 
cagttggtcg 



gctcagaacc 
caagagtagg 
gaaactgaca 
cgttatcaac 
aaggctgaat 
ggtgaaaaat 
gattcagatg 
tgggaccatc 
cctctcacac 
ggtgaacctc 
tctctttttg 
gtacgtggcc 
cactctggcc 
catcctaaga 
gtctgtcctc 
ctgcatgttc 
ggcggccatc 
ctcccacctc 
taagtcccgc 
cccgatgcta 
gagcactgtg 
tgtcacgtgg 
atctgtgctt 
atgtgttgca 
tttcacaatt 
tagcttctag 
ccaigcactct 
tcttcctagt 
ctgtatcctt 
at 



<210> 38 

<211> 1817 

<212> DNA 

<213> Homo sapiens 

<220> ■ 

<221> misc__feature 
<223> Incyte ID No: 



5467661CB1 



840 

900 

960 

1020 

1080 

1140 

1200 

12 60 

1320 

1380 

1435 



60 

120 

180 

240 

300 : 

360 

420 

480; 

540. 

600 

660 : 

720 

780' 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1792 



<400> 38 ■ ^ > . > ■ " . r. 

gacggtccgg ccgcggccca ggccgggagg tggagcgcgg ctgccgccag ctggtccact 60 
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gacaactgct 
gtcttcgatt 
cgtcttgtct 
ggacaccctg 
accaaacatc 
cagggtccgg 
gctctggaag 
caccttctac 
7 catgacggtg,. 
ccgcttcttc 
cctggagagt 
ctactctgtc 
ggactttaat 
cttcctggtc 
cctgccttct 
gcaggggctg 
tgccacaacc 
gggcttCttC 

agaggagcca 
ggctgcaggg 
cggcggggtg 
cagcacagac 
ggaggacagg 
caagggacgt 
ccccacctcc 
tccccttccc 
catgcttggc 
ggctggctct 
caccatggtg 
aaaggagcaa 



gccagctgga 

ccgcccgctt ■ 

ctgcacctcc 

gaggaggtga 

agtgtgcctc ; 

tactgggacc 

cttecatctg 

atcctggtgt 

agcacctcga 

ctgctggcca 

aagtccagca 

acccagggga 

atctatggcc 

tactctctgg 

cggaggagct 

gggagtgtgc 

ttcctgtact 

ggctcggagc 

gatgtacacc 

gctgctgggg 

gcctacctgg 

agcgagcgct 

ccagagagga 

tctgtgggca 

ctggggctcc 

acctgctctc 

aaggttgget 

cccaatgect 

gatggactga 

aaaaaaa 



aggcgcgtag 

tccgagtgag 

tctgtgctgt . 

cttgggccaa . 

atcgctgcct 

tcttgctgct 

ctcgggcgaa . 

ttgtggtggc 

acgctgc_aac_ 

tcgagctgag 

tcaagcgggt 

ccctggagat 

atgggggccg 

tggtcatcct 

tctacgtgta 

tgctgtgctt 

tcagcttctt 

ccaagatcct 

taccccagcc 

cctcagctgc 

atgacatcgc 

ggaaggccat 

ggccagcagg 

gtagccctgt 

caatcccctt 

atactgctca 

gagggcaccc 

ccatcccatc 

agtgtgtata 



ggcgagaggt 

aaacgcaggc 

ctctttaaga 

tgggagcaca 

■gctgctgctc 

catccccaat 

gatccgcatc 

gctggtgggc 

tgttgctgat 

tgtgatcatc 

gctggccatc 

cctgtaccct 

ccagttctgg 

tcccaagacc 

tgcgggcatc 

cgacatcatc 

cgctccgctc 

cttctcctac 

ctacgctgtg 

cagctactcg 

ttccatgccc 

caatgcctga 

cccagagtcc 

gtggccctgt 

tgccatctct 

gtgacatggc 

tccttctctg 

cccatggtgc 

ttttcttgat 



ggcgcgtcct 
ctctctcttc 
gggctggctg 
gcgctacccc 
tacgaagaca 
gtgctcttcc 
acctccagcc 
attgcccggg 
aagatcctgt 
ctgggcctgg" 
accacagtgc 
gatgcccatc 
ctggtcagct 
ccgctgaagg 
ctggcactgc 
gaggggctct 
atctacgtgg 
aaatgccaag 
gcccgacggg 
agcacgcagt 
tgccacactg 
gggcagetgc 
ccaggggagg 
tcccaccatg 
gctctcactg 
ccaggctttc 
t cacccttggc 
tttggcctcc 
-.. ctatttttta 



cagctttcca/ 

cgtgaagcag 

ctggcgggat 

cacccctggc 

ttggcacctc 

tcatcttcct . 

ccatttttat 

ccgtggtatc 

gggagatcac 

cctttggcca^ 

tgtccctggc 

tctcagctga 

cctgcttctt 

agcgcatctc 

tcaacctact 

gctgtgtaga 

ctttcctccg 

tggacgagac 

agggcctgga 

tcgactctgc 

gcagcatcaa 

cagggcctgt 

aggaccaggt 

agtctggagg 

gggaccctcc 

cttccagggc 

acgagggcag 

tcaaagcatc 

ataaaaagga 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660- r 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
;1800 
1817 



<210> 39,;, 

<211> 1820 

<212> DNA - 

<213> Homo sapiens., 

<220> *• 

<221> mlsc^f eature, 
<223> Incyte ID No:; 



229740CB1 



<400> 39 

ctttgagatg 

cacccgagcg 

caagcaggcg 

cagtgtggac 

caacacgtca 

ggggttttcc 

agctgccaag 

ccaagcaticc 

cacacacctg 

cgtgaagaag 

agcactgcag 

tgatgagaat 

caacagagag 

gctccgagaa 

gctcatgtca 

aaacctggaa 

gcgccacctg 

cctgcatgac 

gcccaggccc 

cttctgtgtg 

tgecgtccac 

gttgtggaca 

tatgcagtca 



ctgaagatga 

catgtcttcc 

gtcaccgttt 

ggagccaagg 

cagacgtctg 

tctgagcagc 

atagccaaag 

agtgtcaacg 

aagtctgaga 

tgggagatcg 

gagtcggcag 

gaccagctcc 

aaggagaaga 

aaggagacag 

gagtgcgaat 

gacaaagtgc 

aaggtggagt 

ttccgccgag 

cgcccgtgag 

cattgcttct 

tcactccttt 

actggaagat 

gggcagggat 



tggaagtcta 

agattgaccc 

cctacttcta 

tgatcataaa 

ggcagtgggc 

agctgacaaa 

acaagacgca 

ggacggacga 

atgacaagct 

agctgcagac 

ccagtgtgga 

gcaacaagat 

acacgcagct 

agctgaaaga 

atgtctctga 

gttccttaaa 

tgaagagctt 

ggctctccaa 

tcccaagcgt 

gtaaatgcag 

tcagaataga 

tctgactcag 

gtttatatct 



taaggaacct 

caacaccaag 

tgatgtcaca 

cagcacaatc 

cgacagcaga 

gtttgcagag 

ggagaaaatc 

tgaaaaggcc 

gaagattgcc 

ccttcgggag 

gcagtggaag 

tgatgagctg 

gaagaggagg 

tctccgaaaa 

gaagctagag 

gacagacatt 

cctggaggtg 

gctgggcacc 

gtgtgcgaga 

gcgcagtttg 

aatctcctct 

gaatccagaa 

ttcataaggg 



agagaacagc 

aagaactgga 

aggaacagct 

acaccgaata 

gccaacacag 

aaattccagg 

gagacctcaa 

tctcacgccg 

ttgacgcaga 

agcaatgcac 

aggcagttct 

gaagaacaat 

atcgaggagc 

caaagtgaaa 

gcggcagaga 

gaggagagca 

ctggacggga 

gataactagg 

ccagatagct 

tcgtgtttcc 

cgcttctctg 

ctaggtctac 

ctgttgcaac 



ccatcttcac 

tgcctgcgag . 

atcggatcat 

tgaccttcac 

tgtttggttt 

aggtgaaaga 

gtaatcattc 

gtccagccaa 

gcgcagccaa 

ggctgaccac 

ccatctgcca 

gcagtgagat 

tggaggcaga 

tcatacctca 

gagacaatca 

aataccgaca 

agattgacga 

gctggccgag 

ctaggacgtt 

aaaccagttg 

gccttgtgag 

cttcaacatt 

catatgaact 



60 

120 

180 ..... 

240 

300 

360 

420 

480 

540 . 

600 
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780 
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gaaaaaacac 
cttttatcaa 
cagagagaac 
aaactttctg 
ctgcatccac 
ttccttttct 
agggtgccta 
atgtgaaaaa 



gcattttgta 
atagcattca 
aacattcctc 
agagatggaa 
gctactgccc 
tggttttaat 
gcacaatgtc 
aaaaaaaaaa 



atccaaatat 
gagtatttga 
tctgtcaaca 
aacaagcttc 
cctgcccccc 
ttctgagcag 
tggcacaaag 



tgatattctt 
atgtccacca 
tcgagaggct 
tggtgggtgc 
gcccccccag 
atgattgtgc 
taggtgctta 



tacaccaagc 
gacaccagcc 
ttaaaacaac 
attttctggc 
tttgtacggt 
tgtgggaaca 
ataaatattt 



catcaggctc 
ccggggggca 
tgtttagtgg 
ccggagttgc 
tgcaacagtg 
gcacacagtg 
gttcaattaa 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1820 



<210> 40 

<211> 1620 

<212> DNA : 

<213> Homo sapiens 

<220> 

<221> misc_ feature 
<223> Incyte ID No: 



1317467CB1 



<400> 40 

tccccgccat 

tccgccccgc 

cagcccgtgc 

ctcttctact 

ggttacctaa 

aatctacagt 

taccaaatgt 

gcagccacca 

ctggtcaata 

gtctccttta 

ctgaaactgc 

aggacccagg 

ttgtcttact 

atcatcgtca 

actgtgacct 

aatctaaaac 

ggatcgaagg 

tacattagag 

ctggccctca 

ttgcagctgg^ 

gatgaagaag 

tgtaggactt 

gagcgcaggg 

ccaagaaaca 

tgaagaagga 

gagaattcgt 

agctgtttct 



gtgacgccgt 
cgcgcagcac 
agcatcccgg 
gcactgtcct 
ccaaactcct 
gccctcggca 
gtagttccca 
ccttccagaa 
gccgtgtttt 
aatgccaacc 
actgccatga 
aaagggacat 
cagctttgca 
acaatcacca 
acgcatgtgt 
cttctttgaa 
ttctgaggaa 
cccacccgga 
cactgtgcgc 
ggagggagca 
aggaggagga 
catatcctat 
agcaaatcat 
tgggccagtt 
aggatcccaa 
catgtcattc 
agcacattcc 



ccttagccct 
"gatgcttctg 
cctccgccgg 
ggtctgctcc 
gcaaaaccac 
ttctacgata 
gaagcctgcc 
ggtgctggac 
tggacctgac 
taatgaatta 
atccaagttc 
ctgctcctcc 
agtcctatcc 
ttttggaagc 
tcccaagaac 
gcagaaagat 
agatggaatt 
gagagctgcc 
cctggtcatc 
gctggtgcca 
cccctctgag 
atacagttcc 
tcaggaaata 
ctactgaaaa 
atgcccctcc 
aacactcgtg 
aaaataaatg 



gcgaccccca 
ccgggacgcg 
caggtagagc 
aaagagatct 
accacctatg 
agtgtccaat 
tcccagaggg 
gaatgccaga 
ctttgtccag 
aaaaacaaaa 
ctcaacatct 
aaggcagagc 
cgaaggtgct 
ccctgtttgc 
atactcacag 
ggtgaatatg 
cttgttagca 
ctgctgttcg 
agagagtcct 
ggaagtgaca 
tctgatttcc 
atagaagctg 
tggatgaaca 
ccacatgcat 
agttctggtt 
aggccaggaa 
aggagggaag 



gcgcgtcccg 
cacgccaacc 
cgccggggca 
cagcgctcac 
cctgtgatgg 
cggcatttta 
aagacagctt 
accagcgggc 
gaagcagtaa 
ccgtgtgtga 
actctgcgac 
ggctcccccc 
atgggaagca 
caggcgtgaa 
cgattgatcc 
gtataaactt 
actctctggc 
tgtccagtgt 
gtgccaagga 
aggtcgagga 
caggggaact 
cagagctcgc 
gtggtttgga 
cttgatgcga 
cacctgtacc 
gctattaaag 
agtctttgtt 



ggcctgcgcc 
gccgacgccc 
gctcctgcgc 
cgacttctct 
ggactatttg 
tgggcaagat 
aacctgtgtg 
ctgccacctc 
atacctccta 
agaccaggag 
ctacggcagg 
tttcgattgc 
gagatgcaaa 
aaaatacctc 
agccattgct 
cgacccaagc 
agcctttgct 
ctgcatcggc 
cttccgcgac 
ggacagcgag 
gtcggggttc 
agaaaggatt 
cacctcgctc 
tcgcactttc 
ttctatgaag 
ggatgtttca 
ttctgtaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780/ 

840: 

900 

960. 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 



<210> 41 
<211> 974 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 
<223> Incyte ID No: 



2279267CB1 



<400> 41 

atgggggaca 

aggattcaga 

aacgggacca 

ttcctctcac 

ctggtgaacc 

tttctgtttt 

ctgtacgtgg 



atataacatc 
tgctcctctt 
tactggggct 
acctggcggt 
tcctgcatcc 
ccacttttgc 
ccatctgcca 



catcacagag 
tgggctcttc 
catctcactg 
cgtcgacatc 
agccaagccc 
tgtcacagaa 
ccccctccga 



ttcctcctac 
tccctgttct 
gactccagac 
gcctacgcct 
atctcctttg 
tgtctcctcc 
tatttggcca 



tgggatttcc 
acgtcttcac 
tgcacgcccc 
gcaacacggt 
cgggccgcat 
tggtggtgat 
tcatgacctg 



cgttggccca 60 
cctgctgggg 120 
catgtacttc 180 
gccccggatg 240 
gatgcagacc 300 
gtcctatgat 360 
gagagtctgc 420 
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atcaccctcg 
ttacttctac 
.ttggctgttc 
; ggagcaattt 
ctctgtgcta 
ttctcccacc 
eccagatatg 
aatcccatgc 
aagagagtgc 
tgacctagga 



cggtgacttc 
ctttaccctt 
tcaaacttgc 
ctgggctggt 
tccttcagat 
tctgtgtgat 
ggaaccccaa 
tcaatcccct 
tgggagtaga 
agtt 



ctggaccact; 
ctgtaggccc 
ctgtgcagat 
gggacccttg 
ccaatcaagg 
tggactctct 
ggagcagaag 
tatctgtagt 
aagggcttta 



ggagtccttt 
cagaaaattt 
acccacatca 
tccacaattg 
gaagttcaga 
tatggcacag 
aaatatctcc 
cttaggaact 
tgaaaaggat 



tatccttgat, 

atcacttttt 

atgagaacat 

tagtttcata 

ggaaagcctt 

ccattatcat 

tgctgtttca 

cagaagtgaa 

tatggcattg 



tcatcttgtg 
ttgtgaaatc 
ggtcttggcc 
tatgtgcatc 
ctgcacctgc 
gtatgttgga 
cagcctcttt 
gaatactttg 
tgactgacag 



480 
540 
600 
660 
720 
780 
840 
900 
960 
974 



<210> 42 

<211> 1561 

<212> DNA ; 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



2436258CB1 



<400> 42 

gttaaaacag 

gcgcggagga 

ttcacgcggt 

tggtgttcct 

cagcccagct 

gtgcaagtgt 

ctggttcaga 

tgctcctggg 

cgctcctccg 

gacttataaC; 

acctgcagtt 

cacctgtgtg 

tggatgctat 

tggccgcctg 

ttggggaggc 

aggctgtcgg 

accaccgcca 

aggagcggca 

cccgacccct 

cctccaaatt 

tggtagtgac 

acccacccaa 

tgttcactgg 

ttgtttttta 

ttgtctccct 

ccgacagggg 

c 



ggatgtgcag 
cttggaggac 
cccgccggat 
acaacagggc 
gcggggggag 
cgcccgcatc 
gcgcacactg 
tgccgtgcgc 
acgttgcaca 
agcagcccag 
gctgctcgac 
cctgccccgc 
gcctgtctgc 
ccggcgcctg 
tgccagcttc 
ggcgccgggc 
actgccccag 
gcggctgtcg 
gcgcctcatt 
cgagctctat 
caaactcctg 
gtactccaca 
actctgatag 
' ccttctgtct 
aagcaatggg 
agcaggctgc 



atggaggtcg 

acgcagttcc 

cccgaagacg 

ccactgttgc 

ctgctagctg 

ttegcacaca 

gaccgacttc 

tgtgtgcccc 

gcgcctggcc 

gagcggaatg/ 

tgggtgggtg 

ttcaaccctg 

ctgctgctgc 

gttgaagatg 

tctaatgcct 

ctccggcact 

tttaccagcc 

gacctgtacc 

taccacgtgg 

acctgcctca 

cgctgggtga 

ccaccagcca 

ttggagctcc 

accctggaaa 

gcaaggtctg 

tttccctgcc 



gaggagacac 

ccagtgagga 

agggcctgga 

tcgtggccat 

tgcacgcaca 

agcagaacta 

tggacagtat/ 

ttgcccgccc 

tggcgctgtc 

tg.ctggccga 

caccagcctt 

atggtttttt 

ttggcaccca 

ggatgcatgc 

catcagccag 

tcctgtataa 

ctgagctaga 

accgcctgca 

ctgagaagga 

gccctctggt 

agaaagagga 

cctctacgga 

cagaccaggc 

tgtgtgtggg 

agggcccacc 

cagtcatgca 



tgctgccccg 

agctagagaa 

ggaaacagag 

gtcacggact 

gatcgtgagc 

tgacctccgc 

ggagcaggac- 

gctgcgagac 

agtgctggca 

gtgccggctg 

tgcggcgggt 

ctacgcctac 

acgtgaagcc 

ccttggtgcc 

tgctcctgcc 

gccgctggac 

ggccccctac 

tgctcgtctc 

gacactactg 

gaccaaggca 

ggaccggctc 

ccaagctgcc 

agtgctggga 

ggtgtgtctg 

gatgagagag 

ccttcccctc 



gcccccgggg 

ggtggagggg 

gaccacaagc 

tctcagtcag 

acacttacac 

cgcctgctgg 

ccaggagccc 

gcactaggtg 

gtaggcggtc 

gacccagctg 

gaggcttggg 

gtggcccgcc 

ttccatgcca 

atgcgtgccc 

tacagcgtgc 

atccctgacc 

agcagagagg 

cacagcacct 

gcctgggtga 

ggtgcaatct 

ttcattcgtt 

cataatggct 

gcaaccacct 

tggccagtca 

atggtggcag 

tggggaaatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

,600 

660 

720 

780 
840 
900 
960 
1020 

, 1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1561 



<210> 43 

<211> 1619 

<212> DNA 

<213> Homo sapiens 

<220> , 

<221> misc_feature 
<223> Incyte ID No: 



2681738CB1 



<400> 43 

tgcgagggcg 

tcttgcgcgc 

tggctacaac 

gtgataaggc 

cctcggtggc 



ggatagctgt 
gggaagcagc 
agtccctgat 
tgaagcttgg 
cttcatgctc 



ccaaggtctc 
accaagttca 
ggttgccgca 
ggcatcgtcc 
actctcccga 



ccccagcact 
cggccaacg.c 
atggcctgaa 
tagaaacggt 
tcctcgtctg 



gaggagctcg 
cttggcacta 
atccaagtac 
,ggccacagcc 
caaggtgcag 



cctgctgccc 60 
gggtccagaa .120 
tacagacttt 180 
ggggttgtga .240 
gactccaaca 300 
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ggcgaaaaat 
tcaccttcgc 
ttgggatcct 
agctcgtccg 
tcagcctagt 
acgtcaatgt 
tcacctacgt 
gttccttcac 
ttgccatctg 
atgacaccat 
ttagtcccga 
atgctttctg 
ctcaagagga 
ccacacattt 
acgcttggcc 
ctgaagagtg 
gcgaaatctt 
tcccagcctc 
tcttagaggc 
agtgggagtc 
aggatcttgc 
tcgaccacct 



gctgcctact 
cttcatcatc 
cttttccatc 
ggggaggaag 
ccaggatgtt 
cttttctgag 
cctcttcttg 
gggctggaag 
ggtggcctgg 
cctcagctcc 
gttttggctg 
taaacctcaa 
aatcactcaa 
tcagctgcag 
gagcccttac 
ggacaaatgc 
gagtcttctg 
aaccacaatt 
gctgtagtat 
tcaggcaact 
tctgtcaccc 
gtgctcaagc 



cagtttctct 
ggactggacg 
tgcttctcct 
cccctttccc 
atcgctattg 
ctttccgctc 
atggcgctga 
-agacatgggg 
atcaccctgc 
gccttggctg 
ctcacaaagc 
ctcgtgaaga 
ggttttgaag 
aaccagcctc 
aaagactatg 
agccgggcgg 
agaaaactgt 
cttccatgct 
tttttttttt 
caagtttaga 
aggcttgagt 
aatcctccca 



tcctcctggg 

ggagcacagg 

gcctgctggc 

tgttggtgat , 

aatatattgt 

ctcgtcgcaa 

cot tec teat 

cccacatcta 

tcatgcttcc 

ccaatggctg 

aaegaaaccc 

agagctatgg 

agacagggga 

cccaaaagga 

aagtaaagaa 

cagatctagc 

acaagacact 

ggggctgatg 

ttttggctca 

cccttactct 

gcagtggtgc 

tctccatctc 



tgtgttgggc 
gcccacacgc 
tcatgctgtc 
tctgrggtctg 
cctgaccatg 
tgaagacttt 
gtcctccttc 
cctcacgatg 
tgactttgac 
ggtgttcctg 
catggattat 
tgtggagaac 
cacgctctat 
attctccatc 
agagggcagc 
gggagctcaa 
aegggaacag 
tgggctagta 
tccttaggat 
ttttgtttgt 
gatcacagcc 
ecaaagtget 



atetttggee 
ttcttcctct 
agtctgacca 
gccgtgggct 
aataggacca 
gtcctcctgc 
accttctgtg 
ctcctctcca 
cgcaggtggg 
ttggcttatg 
cctgttgagg 
agagectact 
gccccctatt 
ccacgggccc 
taactctgtc 
agggatgtgg 
tttgcctccc 
agactccagt 
acttctttta 
tttttgaaac 
cagtgcagcc 
gggatgaca 



<210> 44 
<211> 3691 
<212> DNA " ;, 
<213> Homo sapiens/ 

<220> 

<221> misc_f eatiire 

<223> Ihcyte ID No: 2859482CB1 



360 

420 

480 

540 

600 

660 

720 V 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1619 



<400> 44 

ggegagegge 

gcaagtcttg 

atcaaccata 

tcttgagtta 

tctactctta 

acaaataatg 

cagcagctgc 

ggggaaaagg 

aatcagctct 

gagatgagee 

ccagtggccg 

gtaaatgetc 

tgtcagattt 

ctcttagatc 

tgtgtcagag 

tacaatgeag 

eggatacttt 

gccactgctg 

ataaaagaag 

tgctttaaac 

aataaagaac 

cgaaatagtg 

tcgtatgtta 

gaaactccta 

gtacaggagc 

agattattct 

cagataaagc 

gaaagtgaat 

cagtctggat 

aatgatgaac 

cttgaaacat 



gggcggcgga 

aacacattag 

atattctgag 

tggatgacaa 

atagtgaaga 

gaagtgatga 

aaaaagagct 

agcttgtgaa 

ccgcttcatc 

taccagtgaa 

atgaggacag 

ccaggagtga 

cactagatgt 

ctcagacaaa 

ggcatgatgg 

agctcttcag 

acagttttgc 

caccccagac 

atgatggtaa 

tacgecaagg 

ttgccattga 

acatgeaett 

ttaeggggag 

aagataaagt 

ctgttcgatt 

ggcccttcag 

aaagggagag 

cagaaagaga 

cgcaaagttc 

ctctcctgag 

ggggagaact 



ggaggagacg 

tttatacaat 

gcattaaaaa 

ggcttctgtt 

ttttgtcttg 

tgagaaaaca 

agcagatgta 

aaggtcacaa 

caccattaac 

acctggacaa 

cgtagttttc 

agtggaagcc 

taccctttca 

cactgaaata 

aactcctgag 

aatacacgtc 

cactgccttc 

tcctgacagt 

aggttatttt 

aattgataag 

aaggtgtttt 

attagatttg 

ctggaatcca 

cctgtttatg 

tctcctggag 

caaaegtagt 

aaagaataat 

gaggaggaaa 

agtgatacct 

tggatctggt 

gttgtcaaaa 



geaggtegga 
gtggcctgaa 
atatttaatc 
ggaaaaatca 
gtttccaggc 
ggactcaaga 
ctgatggatc 
ctggatggtg 
cctgtgccat 
ggagattctg 
agtaaactga 
ttaaggatga 
gtgccgaatg 
gcaaactacc 
agtgactgtt 
ttccggtgtg 
cgccgttctg 
gacatcttta 
agtgcagttc 
aagattgtca 
ggtcttctcc 
gaatctatgg 
aaatccccac 
accacagctg 
acaaaagtcc 
actactgaaa 
actgacactt 
actacagcca 
tctcctccag 
gatgtatcca 
tggcatctca 



ctgaecaaat 
agtttccctc 
attcatgtgt 
gtgtctcttc 
aaggagatga 
ttgtagggaa 
ctccaatgga 
aaggagatgg 
tagtagggct 
aagctteaag 
cttacttagg 
tgtccatctt 
tgtctgaagg 
ctatetacaa 
ttgetttcac 
aaataeaaga 
ccaagcagac 
ccttctctgt 
ccaaagataa 
tctatgtgca 
ttagtccagg 
gcaaaagttc 
attttcaagt 
tagatttggt 
gcgtttgctc 
atttcttttt 
. tatatgaagt 
gtccttcagt 
aagatgatga 
aagaatgtgc 
acttgaatgt 



atggaagtgt 
ctctttccaa 
tgagactcat 
agactcagta 
gacaccatct 
tggaagtgaa 
cgaccagcca 
gcctctttct 
ccaaaaacca 
tcctttcaca 
ctgtgcctcg 
aagaagccag 
aattgtgaga 
aatcctcttc 
tgaaagtcat 
agctgtaagc 
cccactttca 
gtctttagaa 
ggacagacag 
gcaaacaact 
aaaagatgta 
agatggaaag 
tgtaaatgaa 
aataacagaa 
aectaatgaa 
gaaactaaaa 
tgtatgcttg 
tcgcctgcca 
agaggaagat 
agaaaaaatt 
gagaccgaag 



60 

120 

180 

240 

300 

360^ 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740, 

1800 

1860 
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cagttgtcat ccttagtaag aaacggtgtc cctgaagctc ttcgaggaga ^tc^ggcag 1920 
ctgctagcag gctgtcataa caatgaccac ctggtagaga aataccgcat tcttatcaca 1980 
aaggag?c?c cccaggacag tgctatcacc cgggatatta accgaacatt cccagcccat 2040 
ga??a?ttca agga?acagg aggagatgga caagattcct "tataaaat "gcaaggct 2100 
tattctgtgt atgatgaaga gattggttat tgccagggcc agtcatttct tgctgctgtg zxou 
c^ccttctcc atatgcctga agaacaggca ttcagtgttc tggtcaagat catgtttgac 2220. 
tatcggctca gggaactttt caagcaaaac ttcgaagatt tgcattgcaa attttaccag 2280 
C?5glgcg" ?cltgcagga atacattcct gacctgtaca accacttcct ggatataagc 2340 

. ct?gaag?ac " acatgtalgc^tcccagtgS, tttcttactc " ttca ^^l"^"t *J°J 
ctclacatgg tcttccatat catcgacctg cttttatgtg agggaataag. tgttattttt 2460 
aa?g?cgc« ttggattatt aaagacttcg aaagatgacc tgctgttgac agactttgaa 2520 

.gStgccJtga agltctttag ggttcagctt cctaagagat accgctcaga agaaaatgca 2580 
alalaactaa tggaattagc ctgcaacatg aagattagtc agaagaagtt gaaaaaatac .2640 
•gagaaagaata?Lcacca t gagggaacag caggcccagc aagaagaccc "tcgagcga 2700 
???gagcggg agaataggcg tctacaagaa gctaacatga ggttggaaca ggaaaacgat 2760 
gac??ag«c a?gagctggt gaccagcaag attgcactac ggaaggacct W«t"cgct 2820 
gaggaaaagg cagatgctct gaataaggag ctgctgatga ccaaacagaa ^"tgat 2880 
gcagaagalg agaaaagacg gctggaagaa gagtctgctc agt aaaaga ^atgtgccgt 2940 
Igggaactcg- acaaggcaga atctgagatt aaaaaaaaca gttctatcat tggtgactat 3000 
Saglagatt? gttctcagtt gagtgaaaga ttggagaagc agcagacagc caataaggtg 3060 
gaaattgaga aaattcggca aaaagtggat gactgtgagc S^gccggga atttttcaac 3120 
aaagaagggc gtgtaaaagg cataagctca accaaggagg ttttagatga ggacacggat 3180 
gaSgagSIg agacgctcaa gaaccagctg agagaaatgg agctagaact ^acagacc 3240 
aaactccagc tggtggaggc cgagtgtaag atacaggact tggaacacca tttagggctt 3300 
g??ctcaa?g aggtglaggc agccaagaag acgtggttta accgaacact gagctccata 3360 
laaacaaSaa ccggggttca agggaaagag acttgctgag agcagctgcc gcctcccgac 3420 

■ acIttclgat aalSIacac c???tg t tgc cttctttggc cagatgtgtg at tctgtgap 3480 
ttgtcccagg- accagaatgt acctaagtca gatccataga cgcatgttgg taggtcactg 3540 
aalcagagct tgtgaagcag gcaacctctg gggtaagact actgatacta acaggcctgc ; 3.600 

\ Sgctcagcc glcgctltgg acactctaga aatcactcct cagtgtgacc tcccaggcct 3660 
cttccccgtg tacgtcaaca cctcacccag .c. . > 
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I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This Internal 



ional Search Report has not been established in respect' of certain claims under Article 17(2)(a) for the following reasons: 



1 W beetle they relate to subject matter not required to be searched by this Authority, namely: . " . 7 * ^ ; " ' 

Although claim 18 is directed to a method of treatment of the human/animal _ 
body^the search has been carried out and based on the alleged, effects of the 
compound/composition. . , . -:\ 

2 - E t°hev relate to parted 

an extent that no meaningful International Search can be carried out. specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 V ; 



D because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as folto^ 

; see additional sheet 

1 I"""! As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
" L — I searchable claims. 

2. I I As all searchable claims could be searched without effort justifying an additional fee. this Authority did not invite payment 
of any additional fee. ■ : 

3 HI As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
L_l covers ionly those claims for which fees were paid.. specifically claims Nos.: 



a. m No recuired additional search fees were timely paid by the applicant. Consequently, this International Search Report i 
f-X- 1 rp.tricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



restricted to the invention 

Claims 1-28 (all partially) 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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Claims 20, 21, 23 and 24 refer to agonists and antagonists of the 
polypeptides without giving a true technical characterization. Moreover, 
no such specific compounds are defined in the application. In 
consequence, the scope of said claims is ambiguous and vague, and their 
subject-matter is not sufficiently disclosed and supported (Art. 5 and 6 
PCT). No search can be carried out for such purely speculative claims 
whose wording is, in fact, a mere recitation of the results to be 
achieved. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an -v 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a , ' 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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This International Searching Authority found multiple (groups of), 
inventions in this international application, as follows: 

; Invention 1: Claims 1-28 (all partially) : , : ■; X ^ 

An isolated polypeptide as represented by SEQ.ID.l, and • 
variants, biologically active fragments and immunologic - 
fragments thereof, an isolated polynucleotide encoding said . 
polypeptide, variants and fragments; a cell transformed with 
said polynucleotide; a transgenic organism comprising said 
polynucleotide; a method for producing said, polypeptide, 
variants and fragments; and isolated antibody specific for 
said polypeptide; methods for detecting said, polynucleotide 
or said polypeptide; methods for/ detecting agonists, 
antogonists or modulators of the activity of said 
polypeptide; pharmaceutical compositions comprising said 
polypeptide, agonist or antagonist; a method of screening 
for compounds that specifically bind to said polypeptide; a 
method for screening a compound for effectiveness in 
altering expression of said polynucleotide; and a method for 
assessing toxicity of a test compound comprising a 
hybridization probe derived from said polynucleotide. 



Inventions 2-20: Claims 1-28 (all partially) 

Same as subject 1 but limited to one polypeptide sequence 
selected from SEQ.IDs. 2-12, 14-20 and 22, wherein invention 
2 is limited to SEQ.ID.2, invention 3 is limited to 

SEQ.ID.3, invention 12 is limited to SEQ.ID. 12, 

invention 13 is limited, to SEQ.Ib-.14, ..... invention 19 is 
limited to SEQ.ID.20, and invention 20 is limited to 
SEQ. 10.22. 
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